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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
bo first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A~ Alanine, G=Cysteine, D=Aspartic Acid, E« 
Glutamic Acid, F^Phenyl alanine , G=Glycine, 
H-Histidine, I»Isoleucine, K»Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
Pr»Proline, 0=Glutamine, RtArginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, X=TJnknovnri, *-Stop 
Codon, /=possible nucleotide deletion, 
^possible nucleotide insertion) 








IRKLBGHHHDW7ACDFS PIX5AIXATAS YDTRVYIWDPPJNGDI LM 
BFGHLFPPPTPIFAGGANDRWVRSVSFSHDGLHVASLADDKMVR 
FWRI DKDYPVQVAPI>SNGLCCAFSTDGS VLAAGTHDGS VYFWAT 
PRQVPSLQHLCRMS IRRVMPTQEVQELP I PSKTiLEFLS YRI 


5906 


146 


2038 


REGAGSGRMASGA\YNPYIEI IEQPRQRGMRFRYKCKGRSAGSI 
PGRHSTDNNRTYPSIQIMNYYGK5KV\RITLVTK\NDPYKPHPH 
DLVGKDCRD\GYYKAKFGQB\ RRP \LFPQN\LGIRCVKKKEVKE 
A\IITR\ IKAGINPFDW*KQIjNDIEDCDIjOT 
HGNL\TTALPPV\VSS P I YDNRAPNTAFItR VCR VNKNCGSVRGG 
DE I FLLCDKVQKDD I EVRFVLNDW EAKG I F5QADVHRQVAIVPK 
TPPYCKAITEPVTVKMQI»RRPSDQBVSBSMDFRYlJ?DEKDTy 
KAWCQKTTIiLFQiaiCCtf)HVETGFRHVDQ 

QS AG I TVNFPERPRPGIiLGS IGEGRYFKKEPNL FSI ©AWREMP 
TGVSSQAESYYPSPGPISSGLSHHASMAPLPSSSWSSVAHPTPR 
SGNTNPLSS FSTRTLPSNSQG I PPFLRIPVGNDLNASNACI YNN 
ADDIVGMKASSMPSADLYGISDPNMLSNCSVNMMTTSSDSMGBT 
DNPPJ^LSMNLENPSCNSVIiDPRDLRQLHQMSSSSMSAGANSNTT 








SGIGSMQNEQLSDSFPYEFFOV 


5907 


99 


1873 


-XTLLSSmS * *NLDTKIKSQVKV/ RXGHKKIS WP YPQPAKQNGK 

KATSKVPSAPHFVHPNDHAKrtiEAELKKKWVRBMRKKOXJAAREQE 

RQKRRTTESYCQDVLRROREFEHKEEVLQELNMFPQIiDDKATRK 

AYYKKFRKWK YSDVT LEVI*DAPJ>PLG(^C PXJMBEAVLRAQGNK 

KLVLVLNKIDLVPKBWK KWLD YLRNKLPTVAFKASTQHQVKNL 

NRCSVPVDQASESM^KACFGAKNIiMRVIjGHYaUiGETO 

VQWGLPNVGKSSIiINSLKRSRACSVGAVPGITKFMQEVYLDKF 

XRDrjmPGIVPGPNSEVGTrLRNCrraVQKIiADPVTPVETIMJRG 

NLBE I SNYYGVSGFQTTBHKIiTAVAHRIiGKKKKGGL YSQEQAAK 

AVLAD WVSGKI S FYI PPPATHTLPTHLS AE 1 VKBMTEVFD I EDT 

EQANEDTMECIAlGESDELLG^ 

BNKTXVYKIGDLTGYCTNPNRHC^WAKRNVDHRPKSNSMVDVC 

SVDPJRSVLQRIMETDPLQOGQALiASALKNK^ 

SDSMMHALDLSGHADDGVGD 


5908 


247 


975 


HCGI KKRGEGSGSPSPASGGFQIjGCQI P3PSLPSEEETHPHTRA 
HTRTUlATIiTRRPPRSHSTRI*RF^PIiIX5IX^IASWK/PmBR* 
GWRRPAKAAGASLGVAATGKRGCRMSKRYI^KATKGKLLI IIFI 
VTLKGKVVSSANHHKAHH VKWTCEVVALHRCCN KW Kl KKRS C*T 
VKCSCFPGQVAGTTRAAPSCVDAS I VEQKWWCBWQPCLEGBBCK 
VLPDRKGNS CSSGNKVKTTRVTH 


5909 


1 


5002 


PAIPGSTIIWAPGSHSAARAIX3RHGSbPSQ3C]APGALCGARAPP 

S SNLRADR SMICAQARAGKNliYKNR FLGI AAMAFPSRNS QSLRR 

CECBPIRY5YNPIX)FHN^I^GGPEaX^lPPj5TsbTDLVTSDSR 

STIWRSSY^SIGHSQDLVIHWDIKEBVDAGDWIGMYLrDBVLS 

ENFI^YKNRGVNGSHRGQIIWKIDASSYFVRPETKJCFKYYHGV 

SGALRATTPSVTVKNSAAPIFKSIGADETVQGQGSRR^ISFSLS 

DFtJAMGIJCKjGMFFMPDP YI»KI S IQPGKHS I FPALPHHGQERRS K 

I IGOTVNPI WQABQFSFVSLPTDVLE IEVKDKFAKSRPI IKRFL 

GKLSMPVQRLIjEJRHAIG©RVVSY^rW5RRI*P^ 

TSSIHPDDEEISI»STEPESAQIQDSPMNNIiMESGSGBPRSEAPE 

SSESWKPEQLGEGSVPDRPGHQS I ELS RP AHEAAVITKAGDQGM 

VS VGPEGAGELLAQ VQKDIQ PAP SAEE J^AEQLDLGB B AS AIiIiLE 

DGEAPASTKEEPLEEEATTQ S RAGRBBEEKEQEREGD VS TLEQG 

EGRLQ1RASVKRKSRPCSLPVSELBTVIASACGDPETPRTHYIR 

IHTLLHSMPSAQGGSAAEERDGAEEESTLKDSSEKDGLSEVDTV 

AADPSALEEDREEPEGATPGTAHPGHSGGHFPS LANGAAQDGDT 

HPSTGSESDSSPRQGGDHSCEGC31ASCCSPSCYSSSCYSTSCYS 

SSCYSASCYSPSC^GNRFASHTRFSSVDSAKISESTVFSSQDD 

EBBEN5AFESVPD6MQS PELDPESTNPGAGPWQDELAAPSGHVBR 

SPKGLESPVAGPSNRREGECPILIINSQPVSOljPSLRPEmnr^PT 

IDEPLPPIWBARIDSUGRVFYVDHVNRTTTWQRPTAAATPIX^ 

PvSGS IQQ^QLNRRYQNIQRTIATBRSEEDSGSQSCEQAPAGGG 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence • 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide • 
UUAlanine, C=Cystaine Dispart ic Acid, E- 
Glutamic Acid, F=Phenylalanine, (^Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M-Methionine, N=Asparagine r 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T-Threonine, V= Valine. 
W=»Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion) 








GGGGSDSKAESSQSSLDLRRKGSLSPVNSQKITLIiLQSPAVKFI 
TNPEFFrvi^LANYSAYRVFTSSTCIiiaiMlLKVRRDARNFKRYQH 
NRDLVNFINMFADTRLELPRGWEI KTDQQGKSFFVDHNSRATTF 
IDPRI PLQNGRLPNHLTHRQHLQRLRSYSAGEASKV55RNRGASI* 
LARPGHSLVAAIRSOTQHESLF^YNDFOCVAFLRQPNIFEMLQK 

E I MS YVPLQAAFHPGYS FS PRCS P CSS PQNS PGLQRASARAPS P 

YRRDFEAKLRNFYRKLBAKGFGQGPGKIKLI IRRDHLLEGTFNQ 

VMAYSRKEU)RNKLYVTFVGBBGLDYSGPSRBFFFLIiSQEIjFNP 

YYGLFEYSANDTYTVQISPMSAFVKNHIfEWFRFSGRILG\LALI 

HQYLLDAFFlARPFYKAIiL\RLPC\D\rjSDIjBYIiDKKFHQSLQW 

MKDNNITDIIJ>LTFTVNEEVFGG^^ 

BYIBRMVKJffiVERGVVQQTKAJWRGFYE 

L V IAGTAE IDLNDWRNNTE YRGGYHDGHLV1RWFWAAVERFNNK 
QRLRLLQ FVTGTS SVPYEGFAAPPWBPMGLRRFLP * KKWGKITS 
LPPRG\HT(XQPDWDLPTVSPRrPMLYEK\IJ J TA\VBBTSTFGT 


5910 


1526 


446 


VAE FAAMEPGRTQ I KLDPR YTADLLB VliKTN YGI PSACFSQPPT 
AAQLLRALGPVEI ALTSXLTI>LALGS 1AI FLEDAVYLYKNTI.CP 
X IvKK f liLiWA&b/U?TVV5VIjCCFCT PRS LVLVEMTITS FYAVC 
FYIjLMLVIIVEGFGGKEAVLRTLROT^ 

LLTRKKLQ\R*CWA1*SNTPS*R*R* PWWAjCFSSPTASMTQQTFL 
RGACDYGSTLSSA/CSTIiLALWTLGI ISRQARLHLGEONMGAKF 
AIiFGVLLILTALQPSIFSVI»ANGGQIACS PPYSS KTR3QVMWCH 
tiuJL LtB I r 1M A VI* X KMY X KKKDI1K VG I KTr 3 3 PDXjDIjNIjKALR WM 
AWTMKGCCTH 


5911 


109 


595 


QIjPLAPC I QGKGIiKMRSPKPQS FI IRSSHSGAGM*VKNPSTPVF 
CGHRRGGAAFKVIO'TPVVGPEQRPTGXJKHMRGGVSLLSPRLECS 
GTISAHCNLRLPSSSNSPAPA5*LAGITGVCHHAQL1FVFLVET 
GFHHVGQAGLELL/NVVIHLPRPPKvTiGLQA 


5912 


924 


277 


MILNKAI^LGALAIiTTVftS^ 

SGQYSHBFDGDFJEFYVDIiERKETVWQLPbFRRFRRFDPOFALTN 
IAVTJOlNLNIVIKRSNSTAAraBVP^ 

VTSPSFPFE* *DL*TAKVEQLGAWFBPLLKHWGABIPTTL 


5913 


46 


1198 


QLRMAGAEGAAGRQSFJ^PWSLVDVXEEDBFJ^^ 

DSEKCSYSQGSVKROALYACSTCTPKGEEPAGICLACSYECT 

HKLFELYTI0ttiP^CDCGNSKFT(NLBC3^ 

FFGLYCICKRPYPDPF^EIPDEWIO^VVCEDWFHGRHLGAIPPE 
SGDFQHKVCQACMKRCS FLWAYAAQLAWIQST\GWMDWCXn!lja 
B* /DIXfflVIKPENGIfflODSTIiIffi 

CAGSSSKSDLQTVFKNESLNAESKSGOCIiQKLKAKQLIK 
YWPLNWR^ KIiCTCtJDCMKMYGDLDVLFLTO 
AOATDRSDPLMDTLSSMKKVQQVELIC/GIQ* FKD 


5914 


960 


124 


NJ^SBLPPEEALFIQVASMNQRRVDF^LASIKDMIiVAl/GGRN 

BNGAI^BVETYSPKTDSWSYVAGIiPRFTYGHAGTIYKDFVYISG 

GHDYQIGPYRKN1aLCYDHRTDWBFJlRPT4^ 

lYSIGGSDDWrESMERFDVliGVEAYSPQCNQWTRVAPLLHANSE 

SGVAVWEGR I Y I LGGYS WEOTAFSKTVQ VY DREAOKWSRGVDLP 

KAI AGGSACFI AP*SLGQRTRKRKAKARGTRTGASDPS CASWDH 

PHRHLPGTiCRPAATS 


5915 


1604 


703 


FPGRPTRPI#ICLGRRRKRARI IQAPHaiSPRPRTCPPGALQAPEA 
PASRA£GPVAVVVNGHTFX5PAPARSAPKEPPGLPRPrx;SFPCPT 
PQBDFPAlXSGPCPPRMPPSPGFSAVVLIiKGTPPPPPPGLVPprS 
KPPPGFSGIiLPSPHP\PVSPAPPPPPPQK/RPRLLPAP/PGLPS 
PREI.PGBEPSAHPVHCGLPAERRGPLQRVQEPLRGVQTGPDLRS 
PVLQKLPGPAGGEPPBGIi* *AAGPAAH 


. 5916 


256 


633 


SPRMHEIWGPWHRWEEPSLEGEWPSRIPEPSPDSTKGTSGKGCR 

TVTGAV}n*HLNHVAGIIPWVIiHS^ 

PTRil W*NO^VTADKNN* TTALLQPHQRI»\VS PRMABA 


5917 


1343 


827 


AHQIIiTYLEP/ IdiWNYNKILTVFLTKSVLBI *KFIHTPQTYR 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=&lcLnine, C=Cysteine, D^Aspartic Acid, B=* 
Glutamic Acid, F»Phenylalanine, G=Glycine, 
H~Histidine, I=Isoleucine, K^Lysine, 
L -Leucine, M^Methionine, N-Aspaxagine f 
P=»Proline, Q=Glutainine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








?*tfl3FFGIKBVYVSRRIJ?KTSP/kl^VTFLEC)AVVSKECVPVDQ 

?MFJH,LPSIJ^LA£DPVPNVRVLLAKA^ 

HLEVIBET ILALQSDRDQDVSFPAALEPKRPvNI IDTAVLBKQN 


5918 


13 


1247 


EGAQVARRRSRRQWRAGRCGRGRGGRRAERTGGRGPPGRPR PIjP 

PGPARRGRRRMBTP FYGDBALSGLGGGASGSGGTFAS PGRLFPG 

A?PTAAAGSMMKKDALTI«SIiSEQVAAAXKPAPAPASYpPA\ADG 

APSAAP PDGLLASPDLGLLKLAS PELRRL X IQSNGLVTTTPTSS 

QFLYPKVAASEEQEFABGFVKAIiBDI£^ 

AGGPSGTATGSAPPGEIAPAAAAPEAPVYA\NLSSY\AGGCRGIj 

RGGAAT\VAFAABPVPF^PPPPPGALGPRRP/RIiALGX?RRPQTV 

PDVP\SFGESP\PLSPIET\DTPRRI\KAXRKRL\RNPQIRAPK 

PASRKLGAQSRALBRKSEDPS +SPEHGSLASTASLItRSQVAQLK 

QKVLSHVNSGCQJ^LPQHQVPAY 


5919 


1 


4254 


TSVQGDSOGTPTSSO^SINMBHWISQAIHGSTTSTTSSSSTQSG 

GSGAAHRLADVMAQTHI ENHSAPPDVTTYTSEHSIQVERPQGST 

GSRTAPKYGKAKI^ETGDGVPVSSRVSAKIQQrjVNTLKRPKRPP 

IiREFPVDDFEEIJ^BVOOPDPNQPKPEGAQMLAMRGKOLGVVTNW 

PPSLEAALQRWGTISPKAPXLTTMDTNGKPLYIIjT^ 

KVAYS ILHKLGTKQEPMVRPGDRVALVF PNND PAAFMAAFYGCL 

LAEWPVP I KWLTRKDAGSQQIG FLbGSOJVWALTSDACHKG 

IJ»lCSPTGBIPQFK3GWPKLIjWF\n^KHI^KPPRDWP\pH^ 

NDTAYIBYKTCK\lX5SVIX3VTVTRTAIXTHCQAbT^ 

TI VHVLDFKKDVGLWHG II/TSVMNMMHVTS I PYSLMKVNPLSWI 

QKVCQYXAKVAWKSRDMHWAIiVAHRDQRDINLSSIjRMLIVATO 

ANPWSISSCDAFLNVFQSKGLRQBVICPCASSPEAIjTVAIRRPT 

DDSNQPPGRGVLSMHGLTYGVI RVDSEEKLS VLTVQDVGLVMPG 

AIMCSVKPIXJVPQLCRTDEIGELCVCAYAaXJTSYYGIiSGMTKNT 

FEVFAMTSSGAP I SEYPFlRTGLLGFVGPGGLVFVVGKjMDGIjMV 

VSGRRHNADD T VATALA VEPMKFVYRGRIAVFS VTVLHDE R IVT 

YAEQRPDSTERDS FQWMS RVLQAI DS IHQVGVYCLALVPANTLP 

KTPLGGIHLSF/ITCQLFIiBGSIjHPCNV^ 

PBIGPASVMVGl^V^KRrAQASGRDIiGQlBDNDQARiCFIiFLSE 
VLQWRAQTTPDHI LYTIiLNCRGAIANSLTCVQI.HKRABKIAVML 
MBRGHLQIXIDHVALVYPPGIDLIAAFYGCLYAGCWITVRPPHP 
CNIATTLPTVKMIVEVSRSACLMTTQLI 

TWPLILDTDD * PKKRPAQI COTCNPDTl^YLDFSVSTTGMLAGV 
IWSHAATSAFCRSIKLG^ELYPSP^VATCLDPYCGI^PVLWC^ 
SVYSGHQSIL1 PPSFXETNPALWJ^VSQYKVRDTFCS YSVMEL 
CTKGLGS QTES LKARGIiDLSRVRTCVVVAEKRPR IAI/FQS FS KL 
FKDI^mpRAVSTSFGCRVNIiAICIK^SGPDPTW 
HDRVPJ^VERGSPHSLPLMESGKILPGVRIIIANPETKGPLGDSH 
DGEI WVHSAHNASGYFT1 YGDE^LQSDHFNSRLS FGDTQTIWAR 
TGYLGFIJtRTEIjTDANGKRHDAIjYW 

rETSVIRAHKSVTECmVFTVrrNIJLVVVVEI^SEQByu^DLVPI.V 
TNWIiEEHYTj I VGWWVD IGVTP INSRGEKQRMHLRDG FLADQ 
LDPIYVAYNM 


5920 


1381 


1499 


QLGAVAHAGVSRI PP* LFPPLHPTFIiSIiWCLHHKLP /HPPGA5M 
VRPPWPRRPPAH I SSVRQASTQVPRTVPHTQPvVANTGTQTTGP 
SGVGCCTPGRPIiLPCKCSSAAHSTYRVQEPAVHI PGQEPLTASM 
LAAAPLHEQKQMIGE1tf»YPLIHDVHTX)^^ 
LimESPESI«AKIDEAVAVLQAHQAMEQPKAYMH 


5921 


727 


157 


^/rpnTfyJR tfCZX.VtdTM JZfTT.DITRT'DT.If DMTVB WIVCTT-tf DIf Drirwmi r 

gssvsnsddeisssdsadscdslnppttasptpts ilkrqkqlr 
rm^fdqvtvyyfarrqgftsvpscggssi^maqrh^ 

LCBFAQEQEVNHRE I LREHLKEEKLHAKKMKLTKNGTVES VBAD 

GLriJ^DVSDFJ^IDVENVF^VDDYFFljQPItPTKRR 

IDAF^KQEI^IRLSREECGCDCRLYCDPEACACSOAGIKCQVD 

RMSFPCGCSRDGCY^MAGRIEFNPIRVRTHYL^ 

Q\GAAQQPQ\*GALPDCQLQPDRSTGI,*DPSWIGSKGLSFTGKG 

AAATHLI ILRVIENRGAEGKRK 


5922 


2475 


495 


SYSNWGLFPS^IQVPRSRTONLKpXFLFYSYYEXCMEriJCG \T 
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ID 
NO; 


predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, c=Cysteine, D=Aspartic Acid, B=a* 
Glutamic Acid, ^Phenylalanine, GM3lycine, 
H=Histidine, I^Iaoleucine, KaLysine, 
L=Leucine, M=Methionine, N^Asparagine , 
P=»Proline, Q^Glutamine, R=Arginine, 
S=Serine, T=Threonine, VoValine, 
W=Tryptophan, Y-Tyrbsine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








CLYNATQYKVCSPRNDRPDACYNPSE PAATTVFEIRTGLLLGDT 
SKI ITRTBEKE I PKQ1 TLRFnACAAINSKKLBIGCGSLK * BUS * 
RVENKYVCHESGVCKNCAYWPCVI * AT* KKNKNDSVYLQKGEAN 
PSC^GHC^LELI ITNPLDPHWKKGBRVTLGIireTGLKPQVVI 
LI KGEVHKCSP KPV FQT FYKELNLPAPEIAKKTKNIiFt^lAKNV 
IFLLNGT5 CYVRGGTTIGDRWPWBA* ELVPTDPAPDI IPX * KAE 
ASNF*VLKTS I IRQYCIAREGKDFI I PVGKPNCIGQKLYHSTTK 
TIT* *DLi^E10^FSKFSia^TA*AHABSH*DWTVPSGLY*IC 
RHRAYFRLPNKWADSCVIGTI KPS FFLLPIKMGKLLGPS VYASR 
EKKGXVIGNWKDNEWPRERIIQYYGPATWAQDGSWGYR/TP/VY 
MLNWI IRLQ^ILEII^ETGRALTVIJW^^ 
DYIAVABGXWCRKF^TNCOiQINIKX^VVKN^ 
IQVWHKFDPE5LyGKWFPAIGGFKTIjIVGVLIiVIRTCLIiLPCVL 
PliLFQMI KG rVATLVHQKTS Ai IVNYMNHYRS I SQRDSKS EDESE 
NSH 


5923 


137 


638 


QI^RROK>RFRTSIKRMHPI*RTCPlirr^ 
QQBHREIJWISLBEHQD^BiaMSKYTU^^ 
LKAHQSHSAElESQIDRICEMGEVMRKAVQVDDDQF13(IQEKIiA 
QLELENKELRELLS ISS ESLQARKENSMDTASQAI K 




274 


2146 


F^GKVKDAGAEQWISl^I^CKBSWBTQFSNHLNSLTPPTSVRRM 
PLITTVTfcLKMVARHHKKIiLCSKiu?^ 

SVCWKLKPNTSHI ISILW3QPKALVQI*ETIjAPI»TI I IQKFQTQD 

HMKFHKNIiPI^K^LTPSVPQTVIPKKTGSPBIKI^ITKTICW 

RE1JE^SLDGDLI^KVQASE\Q+NQSIESRICEKRKKSNKEDSSR 

SBFJIKSHOPKI.EPEECNRPNKRVDTVSEKPREEPVLKEGSPSS 

ANTI FCSNNGSVHff \PKPQVGDLVWSKVGTYPW WPCMVSSDPQL 

EVHTKINTRGAREYHVQFFSNTQPERAVfVHBKRWEYKG^ 

LLAFJVTKQASNHSEKQKIRKPRPQRERAQVfDIGlAHAEKALKMT 

REERIEQYTFIYIDKQPEBAr^QAKKSVASJOEVKKTRRPRSVIi 

NTQPEQTNAGBVAS S LSSTE I RRHSQRRHTSABBEEPPPVK1AW 

KT?AARK£IjPASITMHKGSLDLQKCNMS pvvkieqvfalqnatg 

DGKFIDQ FVYSTKG I GNKTE I S VRGQDRL 1 1 STPNQRNEKPTQS 

VSSPEATSGSTGSVEKKQQRRSIRl^ESEKSTEVVPKKKIKKE 

QVGFLHVBS 


5925 




1911 


MMTAF^RKATGLSPQAAQBKDGlVIVKVEEEDERDHMWG<aDSTL 
QDTPPPDPE IFRQRFRRFC^QNTFGPRKALSRLKELCHQWLRPE 
IHTKEQII^IjLVI£QFI»SII»PKEU3fVWLQE 
DI£IiDI^GO^VPGO^GPEMLARGMVPIx0PVQESSSFDLHHEAT 
QSHFKHSSRKPRLIiQSRArJPAAHrPAPPHEGSPRDOAMASALPT 
ADSQAMVKIEDMAVSLI LEEWGCQNLARRNI^RDNRQENYGSAP 
POGGENRHR^ESTS KAETSEDSASRGB TTGRSQKBFGEKPJXJE 
GKTGERQQKNPEEKTRKEKRDSGPAIGKDKKTITGBRGPREKGK 
GU3RSPSI^SNFTTPEEVP0X^KSHRCDEC»KCFTRSSSI*IRHK 
1 rHTGEKPYBC^BCGKAP\SI^S\NLVLHQRI\HTGBKPHECNE 
CGKAFSHSSNL>ILHQRIIISGEKPYBCNECGKAFS0SSD\L1 , KHQ 
RnmSEK^YECSECGKAPWRNSYT.rLHRRVHTREKPYKCTKCGK 
\AFTRSSTTiTL»HHR I HARERASEYS PASLDAFGAFLKSCV 


5926 " 


2 


233 


DRCLMUCQGSQPGSPPAT/CBPPAPPVYQAPCQSCPBPPGAtlEP 
SDSPHHTPVHPPPEHSAACPAPATCCPPPRSSMS 


5927 


4146 


1248 


KHFSKK?SQALYC^KRPASGQNSISVMPAOKITKPAAKYGIPIiA 
YXKYGDKKLHEKKRI^KHKC^QTPBKRWTGEERRKI SEEAAR 
KRRLEFIEKEKKQKDQI 1 SLMKAEQMKRQEKERLERIKRAREQG 
WRNVLSAGGSGEVKAPFIjGSGGTIAPSSFSSRGQYEHYHAIFDQ 
MQQQRAEDNEAXWKRB I YGRGLPERQKGQLAVERAKQVEEFLQR 
KREAMQNKARABG12<G I LQNIAAMYGGRPS SSRGGKPRNKEEBV 
YIiARLRQIRLQNFtffiRQQIKAKljRGBKKEANHSEGQEGSEEAC*! 
RRKK\ I ES LKAHANARAAVUCEOLERKRKEAYEREKKVWBEHriV 
AJCGVXSSDVSPPLGC3iETGGSPSKO£MRS^ 
SLTDTRETSEEMQKTNHAIS SKREILRRI«Nfifl 
LSDTFEINVHEDAKKHBKEKSVSSDRKKW^ 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C^Cysteine, D=Aspartic Acid, E=* 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=»Histidine, I^lsoleucine, K>I»ysine, 
L=Leucine, M-Methionine, N^Aspacagine, 
P=Prol ine , Q=;Glut amine , R«Ar g inine , 
$= Serine, T=Threonine , V«Valine, 
W=Tryptophan, Y=Tyrosine f X=Unknown , *=Stop 
Codon, /-possible nucleotide deletion, 
V*possible nucleotide insertion) 








I^I^mJUENT-TIRSEISPEGEinfKPLITGEKKVQCISHSINPS 

AIVDSPVBCKSPEFSEASPQMSLKLEGNLEB?DDLETEIIjQEPS 

G'mKDE\SLPCTITDVWISEKKETKBTQSADRrTIQHNEVSEDG 

VSSrVDQLSDrHIBPOTNDSQHSKCDVDKSVQPEPFFHKyVHSE 

Hl^VPQVQSVCraPEESFAFRSHSHIMKN^^ 

LFDANNPK^RTCSIJ>DLSKI.FRTLMDVP^ 

BDENIKKGPSDSBDIVFEETDTDT.QEL0A5MEQLLRBQPGEEYS 

EEKESVIiKNSDVEPTANGTDVADEDDNPSSESALNEKWHSDNSD 

GEIASECBCDSVFNHLEETjRIjELEOjFJrc 

DKDKN X E ICS KI VC^LGNEHQHLYAKILHliVMADGAYQEDNDE 


5928 


414S 


1248 


KHFSKFGSQALYQkKRPASGQNS I SVMPAQKI TKPAAKYGI PLA 
YKKYGDKKIJIEKJCPIXJKHKQAHO/rPEKRVNTGEERRKISEEAAR 
KlffiLEFrEKEKKQKIXJIISIiMKAEQMKRQEKERliERINRARBQG 
WRNVLS AGGSGRVKAP FLGSGGTIAPSSFSSRGQYEHYHAI PDQ 
MQOQRAKDNEAKWKR E I YGRGLPE RQKBQIAVERAKQ VEEFLQR 
KREAMQNKARAEGHMGILQNIiAAMYGGRPSSSRGGKPRNKEBEV 
YIJU^ LRQIRIjQHFNFJIOXII KAKIiRGEKKEANHS EGQEGSEEADM 
RRKK\ I ESLKAHANARAAVIjKEQLERKRKBA YBREKKVWEEHIjV 
AKGVKSSDVSPPLGQHETGGSPSKO^MRSVISVTSAIjKBVGVDS 
SLTDTRET3 EEMQKTNHAISSKRE I LRRLNKNLKAQEDBKGKQN 
LSDTFE INVHEDAKEHEKEKS VS SDRKKWEAGGQIiVX PUDEI/Tti 
DTSFSTTERHTVGKVIKLGPNGSPRRAWGK5 PTDS VLKILGRAB 
I^LOlBIiLEinTIRSEISPEGEKYKPLlTGEKKVQCISHEINPS 
AIVDSPVBTKSPKFSEASPOMSIiKIiEGNLEEPDDLBTEILQEPS 
GXNKDE \ SLPCTITDVW 1 SEEKETKETQSADRIT I QENEVSEDG 
VSSTVIXSI^DXHIEPGTNDSQHSKCDVDKSVQPEPFFHKVVIISE 
HLNLVPQVQS VQCS PEE S FAFRSHSHL. P P KNKNKNSLJjTGLS TG 
LFDAIWPKMLRTCSLPDI»SKLFRTIjMI>VPT^ 
BDFJWIKEGPSDSEDIVFBETDTDLQELQASMEQLLREQPGEKYS 
EEEESVLKNSDVEPTANGTDVADEDDNPS SESAMJEEWHS DNSD 
GEIASBCECDSVFNHLEEIJlIiHIiEQEM^ YEKI KAXHE 
DSDENIE ICSK I VQNT LGNEHQHt»YAKI LHI>VMADGA YQEDNDE 


5929 


..... . 3 


1558 


LDFS^rTQLPAYVA X LLFYVS RASCQDTFTAAVYEHAAILPNAT 

LTPVSREEAT4VLMNRNt»D I LEGAITSAADQC? AH I IVTPBDAIYG 

WNFNRDSLYPYI.EJDI PDPEWH I PCNNRNRFGQTPVQSRJjS cl\ 

AK1WSIYVVANIGDKKP<3)TSDPQCPPI^RYQYNTDVVF\DSQG 

KI»VARYHKQNI>FMGENQFNVP KEPBIVT FNTTFGS FG I FTCFDI 

LFHDPAVTLVKDFm^IVFPTAWMNVLPHt^VEFHSAWAMGM 

RVNFLASN IHYPSKKMTGSGI YAPNSSRAFIIYDMKTEEGKLIiLS 

QI*DSHPSHS AWNWTS YASS I EALSSGNKE PKGTVFFDEFTFVK 

LTGVAGNYWC^KI)LCairiSYKMSENIPlTEVYAIX3Aro 

GR YYLQ I CTL^CKTTNT^I^^GDSAETASTRFEMFSIjS GTFGTQ 

YVFPEVIJ^EiJQIAPGEFQVSTDGRLF3LKPTSGPVLT\^ 

LYEKDHASNASSGLTAQARIIMLIVIAPIVCSLSW 


5930 


113 


6082 


RGNCFWIVPFTimQRTGI^PBRYI^DRAVIYNPATOADVraAK 
KLVWIPS ERHGFEAAS I KEERGDEVMVEIiAENGKKAMVNKDDIQ 
K^INPPICPSKVBDMAEl>^CIlNEASVLHNLXDRYYSGLIYTYSGIiF 
CVVINPYKWLPXYSENIIEWYRGKKRHEWPPHIYAISESAYRCM 
LQDRFJXJS IL(^BSGAGKTENTKKVIQYIJUIVASSFIKGRXDH^ 
I PGE\l^RQI»LGANPILESFGNARTVDjNDNSSRFGKFlRXNFDV 
TGYIVGANIETYLIiEKSRAVRQAKDBCTFHIFYQLI^GXAGEHlT 
K3DLIaIaEGFNNYRFLSNGYIPIPG0\QDKGNFRGDPGEAr^HIMG 
FSHEEILSMLK^SSVIX3FGNXSFKKBRNTIX^MPENTVAQKL 
CHIJ^GMm^EFTRAILTPRIKVGRDYVQKAQTKEQADFAVFJUiA 
KATYE^FRWLVHRINKALDRTKRQGASFIGILDIAGPE I FBLN 
S FEQLCINYTNE!O j G£IjFNHTMP I LEQEEYQREGI EWNF EDFGL 
DUJPCIDLIERPANPPGVLAIiLDEBCWFPKATDKTFVEKLVQEQ 
GSKS KFQKPRQLKDKADFCIIITYAGKVDYKADKHI^KN^PIjND 
NVATLLHQS SDRFVAELW KDVDR I VGLDQVTGMTBTAFGSA YKT 
KJCGMFRTVGQtiYKESLTKLMATIjRin^PNFVRCI IPNHRKRAGK 
I^PHliVJ^I&CMGfl/I^IRICT^ 
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NO: 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine r D^Aspartic Acid, Ec= 
Glutamic Acid, F=Pheuyl alanine, G=Glycine, 
H^Histidine, I«Isoleucine, K=I»ysine , 
L^Leucine. ^Methionine, N=*Asparagine, 
P»Proline # 0>*G1 ut amine , R=Arginine, 
S=Serine, T~Threonine, V=Valine, 
W=Tryptophan, YsTyrosine. X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








NAI PKGFMDGKQACERM I RALRLDPNLYRIGQSKI F FRAGVLAH 
LE E ER DL.K I TD I I I FFQAVCRG YLARKAFAKKQQQLS ALKVLQR 
KCAAYLKLRHWQWWRVtT T KVKPLI J QV'rRQEKELQAKUEEtiI,KVK 
HKQTKVEGEIiKEMERKHQQLLBEKNIlAEQIjQAETELFAEAEEN 
RARIJU^QELERII^IDLE^IWEEEEERNQILQNEKKKMQAIIIQ 
DLEBQLDBBBGARQKLQIjBKVTAEAKIKKMKKKTTiTiTiKIX^WS^ 
IKBKKLMKI)RlAECSSQIiAEEKKKAKNEAKIRNKOKVMISDLEE 
REKOEKTRQELKKAKRKIjDGBTTDLQDQIAEL0AQIDEIiKLQI> 
AKKEEEItfSAIJVRGDDETLHKlB^ 

kka5rnkaekqk^lsbklealhteled0 , ldttaaqqelrtkrb 
qrva klkkaeeeetknhkaqiqdmroRhatai j?keseoeeqakr 
fkam>bknkqgi*btonkeiacsvicvlqq 

QELHAKVSEGDRI^VEIiAEKASKIiQNEI^NVSTLLEEAE KKGI K 
FAKDAASLESQLQDTQEIiIjQEBTRQia^NLSS RI RQliEEEKNSLQ . 
EOOEEKEKARKNLEKQVIiALOSQLAUXKKKVDDDLGTIESLEEA 
KIOU^KDAFALSQRLERKAtAYDKLEXIT^ 
HQRQVASNEEKK^\KTOT>QLrAEEKSlSAEY7lEERDRAEAKARE 
KETKALS LARALBEALEAKEEFBRQNKQI .RADM EDEiMS S KDDVG 
KNVHELEKSKIUuVBQQV\B£MRTQI>EBIiEDBLQATE^^ 
m^AMKi\OFERDr^^DK^NEEKKRI^IKQVRELEAEEEDKRKQ 
RALAVAS KKKME IDLKDLEAQIEAANKARDEVI KQIiRKLQAQMK 
DYQRBI^EARASRDEIFAQSKKSEIOa^I^EIU)U)EELASS 
ERARRHAEQERDEI»^EITNSASGKSA1XDEKRRI*EARIAQLEE 
ELBE BC^NMEIJjNDRFRKTTLQVDTL^ 

QQLERQN KEtJCAKLQELEGAVKSKFKATISALEAKIGQLBEQIjE 
Q EAKERAAANKLVRRTEKKIiKEI FMQVEDERRHADQYKEQM B KA 
RARMKQLKROLEEAEEEATRTVNASRRKLQRELDDATEANEGT^SR 
KVSTLKNRLRRGGP ISFS SSRS GRRQLHIjEGASIjEIjSDDDTES K 
TSCJVNBTQPPQSB 


5931 


113 


6082 


RGNCFWIVPFrMAORTGLEDPERYLFVDRAVIYNPATQADWTAK 
KLVW1 PSBRHGFEAASIKBERGDEVMVEIAEMGKKAMVNKDD iq 
KMNPPKFSKVBDMAELTCLNEASVLK^^ 
CWINPYKNI^IYSEWIIEMYRGKKRHBMPPHIYAI3E9AYRCM 
LQDREDQS ILCTGEEGAGKTENTKKVIQYLAHVAS SHKGRKDHN 
IPGB\LBRQLIjQANPIIiESPGNARTVQNDNSSRFGKFIRINFDV 
TGY I VGAN IETYIJUSK3 RAVRQAKDERTFH I FYQH>SG\AGEHL 
KSDQXEQFNNYRFLSWGYIPI PGQ\QDKGNFRGDPGEAMHIMG 
FSKEBXLSMLKWSSVI^FGNIS FKKERNTIXJASMPBimmQKE 
CHLLGMNVMEFTRAILTPRIKVGRi)^ 

KATYKRLERWLVHRIMKAl^RTKRQGASFIGILDIAGFEIFELN 

SFEQI^IKYTNEKlWLFTQHTMFIt^BBYQRBGIEWNFIDPGb 

DLQPCIDLIERPANPPGVIJUjIiDBECWFPKATDKTFVEKLVQEQ 

GSHSKFQKPRQUCDKADFCIIHYAGKVDYTM^ 

NVATnLHQSSDRFVABLWKDVDRIVGIJ^VTGMTET 

KKGMFRTVGQLYKESLTKLMATLRim^FVKCI I PNHEKRAGK 

LDPHLVIiDQIiRCNGVLEG IRI CRQG FPNRI VFQBFRQR YE ILTP 

NAI PKGFMDGKQACBRM I RALBUDPNLYR IGQSKI PFRAGVLAH 

LEEERDIJKI TDI 1 1 FFQ AVCRGYlJulKAFAK^QQLS AIjKVLQR 

NOUWY^KIiIUIWQWWRVFTKVKPLLQVTRQBEBLQAK^ 

EKQTKVEGBLBEMKRKIlQaiiLREKNI LAEQLQAETEXFAEABEM 

RARLAAKKQELEEILHDLBSRVBBEEERNOIljCfN^ 

DLEEQIjDEBEGARQKIjQIiEKVTAEAKI KKMEEE I LLLBDQNE KP 

IKHKKW4EDR IAECSSQLAEEEEKAKNLAKIRNKQEVMISDLEE 

RXjKKEEOT?QELEKAKRK1jIX5ETTDI^IX3IAKLC^QIDELKLQL 

AKKEF^LQGALARGDDBTLHKimAUCVVPvEI^ 

KKASRNKAEKQKRDT^EELEAIiKTFJjBTyJTiDTrAAQOBERTKRE 

QBVAKl.KKALEEETKmiEAQIQDMRQRHATALEEI>SEQI»EQAKR 

FKANLBKNKQGLETDNKEIiACEVKVTjQQVEAESEHKR KKLDAQV 

QELHAKVSEGDPJjRVBIiABKASKLQNEliDNVSTIJ^EK 

FAKDAASLKSQLQDTQBIiLQEEl'HQKLKrLSSR IRQLBEEKNSLQ 

EQQEEE^BARKNIiEKQVIiALQSaLA^ 



404 



WO 01/53312 



PCT/USOO/34263 



SBQ 
ID 
NO: 


Predicted . 
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nucleotide 
location 
corresponding 
to first 
amino acid 
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amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
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residue of 
amino acid 
sequence j 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E» 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=HiBtidine, I=»Isoleucine, K=Lysine, 
L«Leucine, M^Mcthionine, N~Asparagine , 
P«Proline, Q=Glutamine, R^Arginine, 
S^Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=Unknown, *-Stop 
Codon. /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








KKKLLKDAEAI^QRLEBKAT AYDKl^KTKNRLQQE LDDLTVDLD 
HQRQVASNLEKKQ\RKET)QLIiAEEKSISARYAEERDRAEAEARE 
KETKAIjS IiARALEEALEAKEBFERQj^QLRAIWiEDLMSS RDDVG 

knvhelbkskraleqqv\eemrtqi^ledblqatbi>akl^ev 

NMQAMKAQFERUIjQTRDRONKBKKRLIiIKQVRBTjRAKLEDKRKQ 

I^IAVASKKKHBIDLKDLEAQIKAANKAPJDEVTKQ^ 

DYQRB LEBARASRDK I FAQSKES EKKLKSLEAEILQLQEELAS S 

ERARRllABQERDEIJa)EITNSASGKSAIJJ3EXRRLEARIAQLRR 

BI^EEBQSNMELLNDRFIUCrT^Vim^^ 

QQLERQNKKLKAKLQELEGAVKS KFKATI SAI*EAKlGQIiBEQLE 
QEAKBRAAANKIjVRRTEKKLKEI fmqvedjerrhadqykeqmkka 
KARMKQLKRQLEEAEEEATRANASRRKLQRKLDDATKANKGLiSR 
EVSTLKNRrJRRGGPISFSSSRSGRRQLHjUEGASLPXSPDDTESK 
TSDVNETQPPQSE 


5932 


33 


S72 


RHLBEICFLFI^KBRKIJCLSGPRWKEGKPRGTGGLWVKABANMG 
FGATIAVGLTIFWiSVVTIIICFTCSCCCLYKTCRRPR 
PHPP/PVVHAPYPQPPSVPPSYPGPSYQGYHTMPPQPGMPAAPy 
PMQYPPPYPAQPMGnPPAYHF/njAGGAAAPYPASQPPYNPAYMDA 
PKAAL | 


5933 


1 


3190 


GTRKiaOIADKTPGGSQKASSKTHSSDVHSSGSSDAHMDASGPSD 

SDMPSRTRPKSPRKHNYRNESARESLCDSPHQNLSRPLLENKLK 

APSIGKMSTAKRTI^KKEQBKI^CKBDBKAAAKIYEEPIJ^ 

SPX3NKVKTFVRGGVVNAAKEBHETDEKRGKIYKP3SRFADQKNP 

PNOSSNEKPPSLLVIBTKKPPLKKGEKBKKKSNLBLFKEELKQI 

QBBRDBRHKTKGRI^FEPPQSDSDGQRRSMDAPSRRNRSSGVIi 

DDYAPGSHDVGDPSTT\NFYU=NI \NP0t1NLKJCCCCGEFGRFGP 

lASVKXMWPRTDEERARERNCGFVAFMNRRnAERALKNIiN 

MSFEMKLGWGKAVPIPPHPIYTPPSMMBJITLPPPPSGLPFNAQP 

RKRI«KNPNAPMLPPPKNKEDFEKTI^C^lVKWIPTERNLIJUiI 

IIRMIEFVVREGPMPEAMIMN^INNPMFRFLFENO^PAHNTYYRW 

KLYS I LQGDS PTKWRTBDFRMPKNGS FWRPPPLNP YLHGMSEEQ 

ETEAFVEEPSKKGALKBEQRDKLKEILRGXiTPRKNDIGDAMVFC 

LNNAEAAKEIVDCITESI>SILKTPLPKKIARLYIiVBDVLYNSSA 

KVANA5YYRKFFRTKLCQI FSDLNATYRTIQGHLQSENFKQRVM 

TCF1^TODWAIYPEPFLIKLQNIPTX?LVNIIKEKETKDVPDDIJ> 

GAPIEEELDGAPLEDVDGI PIDATPIDHUXiVP I KSLDDDLDGV 

PLDATEDS KKNEPI FKVAPSKWEAVDBSELEAQAVTTSKHELFD 

QHBESEEE F3JQH QEEESBDEEDTQSS KSRBHELYSNPI KBEMTB 

£ KFS KYSEMSEE JGUOa.RBI BI*KVMKFQPELESGKRPKKPGQSF 

QE0VBHYRDKIiLQREKEKELERERERDKKDKEKLE3RSKDKKEK 

DECTPTRKERKRRHSTSPSPSRS3SGRRVKSPSPKSERSBRSER 

S HKE S SRSRS SHKDS PRDVS KKAKRS PSGSRT PKRSRRSRSRS P 

KKSGKKS RSQSRS PHRSHKKS K331CnSTGRKP FKKAVTYWKCDLF 

LCPERSVF 


5934 


1 


3190 


GTRIO^KMADKTPG^SQKASSKTRSSDVHSSGSSDAHMDASGPSD 

SDMPSRTRPKSPRKHNYRNESARESLCDSPHQNIjSRPLJjEKKLK 

AFS IGKMSTAKRTJiS KFvEQEELKK^DE KAAAE I YEBPIiAAFEG 

SDGNKVKTPW3GVVNAAKBEHBTOBKRGKIYKPSSRFADQKNP 

PNOSSNERPPSIiVIETKKPPIiKlCGEKEKKKSNLELFKEELKQI 

QSERDBRHKTKGRI*SRFBPPQSDSTX5QRRSMDAPSRRNRSSGVL 

DDYA^GSHDVGDPSTT\NFYI/^I \NPQMNIJCKCCCQBFGRFGP 

LASVXIMWPRTDEERARERNO^FVAFMNRRDAEPJ^KNL^KMI 

MSP^KIiGWGKAVPIPPHPIYTPPSMMBHTLPPPPSGI.PFNAQP 

RERLKNPKAPMLPPPKNKEDFEKTI^OAIVKVVI PTERNIjLALI 

HRMIBFVVREGPMFBAMIMNHEINNPMFI^LFKtra 

KLYS ILQGDSPTKWRTEDFRM FKNGS FWRP P PLNP YLHGMSKEQ 

BTEAFVEEPSKJKGALKEEQRDKIiEEILRGLTPRKNDIGDAMVFC 

LNNARAAEE I VDC1TESLSI LKTPLPKKI ARLYLVSDVLYNESA 

KVANASYYRKFFETKIiCQIFSDLNATO 

TCFRAWEDWAI YPB PFL IKLQNI FLGLVNI I EEKBTED VPDDLD 
GAPIEEBl*DGAPIiBDVlX^PIDATPlDDlJ)GVPIKSIjDDDriDGV 
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ID 

| NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide \ 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine , c-Cysteine, Dispart ic Acid, E= 
Glutamic Acid, F=Phenylalanine, G»Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L-Leucine, M-Methionine, N^Asparagine, 
P=Proline, Q^Glutamine, R-Arginine, 
S-Serine , T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PliDAIBDSKKNBP 1FKVAPSKWEAVDBSELEAQAVTTS KWELFD 
QHEESKKEEMQNQBEESEDBEDTQSSKSEBHHtiYSNPIKEBMTE 
SKF6KYSEMSEEKRAKLRB1BI>KVMKPQDBLBSGKRPKKPGQSF 
QEQVK]YRDKLLQRBKEKELHRKRSRDKKDKEKX£SRSKDKKEK 
DECTPTRKBRKRRHSTSPSPSRSSSGRRVKSPSPKSERSBRSER 
SHKESSRSRSSHKDSPRDVSKKAKRSPSGSRTPKRSHRSRSRSP 
KKSGKKSRSQS RSPHRSHKKS HGKTNTGHKFFKKAVTYWKCDI#P 
LCPERSVP 


5935 


3 


4493 

• 


3 Y WI^GWRLSRPPRQFWAGHRGIGRPGTMAPVHGDDCE IGASAL 

SDSG3PVSSRARRBKKSKKGRQEALERIJKKAKAGKRYKYKVEDF 

TGVYEBVDEEQYSKLVQARQDDDHI VDDDGIGYVEDGREI FDDD 

IiEDDAUDADEKGKDGKARNKDKRinna^^ 

GKKTADKAVDLS KDGLLGDILQDI JJTETPQI T P PPVMI LKKKRS 

IGAS PNPFS VHTATAVPSGKIAS PVS RKE PPLTP VPLKRAE PAG 

DDVQVESTEEEQESGAMEFEDGDFDEPMBVEEVDLEPMAAKAWD 

KESEPAEEVKQSADSGKGTVSYLGSFIJPDVSCWDIDQEGDSSFS 

VQEVQVDSSHLPLVKGADE EQVFHFYWIiDAYEDQYNQPGVVFIiF 

GKWIESAETHVSCCVMVTCNIERTLYFI^REMKrDLNT^ 

EISMKimEEPDEKIATKYKIMKFKSKPVEKNYAFEIPiyVPEECS 

EYLEVKYS^MPQI^QDUCGETFSHVFGTOTSSIiEIiFLMN 

GPC^IaEVKKSTAIiNQPVSWCrKVEAl^LKPDLVNVIKDVSPPPLV 

VMAFSMKTMQNAKNHQNE I IAMAALVHHS FALDKAAPKP PFQSH 


FCWSKPKDCI FPYAFKBVIEXKNVTCVEWAATEllTIJtXSFTIiAKV 

HKIDPDI IVGHNIYGFELEVLLQRINVCKAPHWSK1GRLKRSNM 

PKLGGRSGFGERNATOJRMICDVEISAKELIRCKSYHLSELVQQ 

ILKTKRVVIPMFMIONMYSESSQLLYLIjEHTWKDA\KFIIiQIMC 

BMTVIiPLALQITNIAGNIMSRTIjMGGRSE^ 

IVPDKQIFRKPOXJKLGDBDEEIDGDTNKYKKGRKKGAYAGGLVL 

DPKVGFYDKFI IiIiLDFNSLYPSI IQBFNICFTTVQRVASEAQKV 

TBDGEQ3QIPELPDPSI»EMGI LPRBIRKLVERRKQVKQUIKQQD 

UJPDLII^YDIRQKALKLTANSMYGKn^FSYSRFYAKPLAALVT 

YKGREILMUTKEMVQKP1NI>EVIYGDTDSIM INTNSTNIiEBVFKL 

GNKVKSE\WKl,YKXiEIDIIX^KSI»L:u^ 

GNYVTKQELKGI^IVRRDWCDIjZUa>TGNFVIGQI 

BWIQKRLIBIGENVL^SVPVSQFEIHKALTKDPQDYPDKKSLP 

HVHVALW insqggrkvkagdtvs YVI CQDGSNLTASQRAYAPEQ 

LQKQDNLTIDTQYTbAQQIHPWARlCEPIDGIDAVLIATGWBL 

\DPTQFKVHHYHKDBF^DALI/^PAQLTDEBKYPJX!BRFKCPCP 

TCGTENI YDNVFDGSGTDMB PSI*YRCSNI DCKAS PLTFTVQLSN 

KIJ3ffiIPJiFIKKYYDGWLICEEPTamRTRHLPI^FSRT^ 

ACMKATKiPKYSDKSLYTXJLCFYRYIFDAECMiBKliTTO 

LKKQFFTPKVIjQDYRKIiKNTAEQFLSRSGYSEVNIiS KLFAGCAV 

K3 


5936 


1124 


139 


RGEEQFDAEF^FACLGFGERIiQEFSRl^LRAVm^RAWTCYLAI 
RMLMATCCPSPTTTACTG P WQRAPPLRWjVQKRBADS SGLAFAS 
NSI^RRKKGIXI^PVAPLRTRPPIJilSLPQDFRQVSSVIDVDLl. 
PETHRRVRLHKHGSDRPLGFYXRDGMS VR VAPQG\ LERVPG I PX 
SRlAfl*GGIiAESTGLLAVSDBII#EV^ 

SH^\LrVTVKPANQRNNVVRGASGRLTGPPSAGPGPAEPDSDDD 
SSDLVIElTOQPPSSIJGI^QGPPCWDIiHPGCRHPGTRSSLPSIiDD 
QEQASSGWGSRIRGDGSGFSL 


5937 


31 


1600 


PTSLI*KSTVQIJtfCRLLQDKIiYQ£VYSLAEIF^ 
YGLTSSYSLWWMLRSSUCQYS FEALRBKSNYSDI PDVKNDFAFI 
LHLRDQYDPLYSKRFSI ET*SF^SENKLK£INLSNBWTVEKLKSK 
LVKlWDKlELHLFMI^GIiPDNVFBLTBMEVLSriEIil PEVKLPS 
AVSQIiVNIjOLRVYBS9LVVDHPAIAFI£EHI»KI1^^ 
IPRWVFHLKNLKELYLSGCVLPEQI^TMQIiBGFQDIiKNLRTLYL 
K5SLSRI PQWTDLLPSIiQKLSIiDNEGS KLVVLNl^KKWWLKS 
LELXSCDLERIPHSlFSr^I^I^LRBiniLJCTVEBIISEX}^ 
NLSCLKLWHNNI AYI PAQIGAI*SN1»EQLS1JDIB^IENLPLQLFL 
CTKLHYLDLSYNHLTFI PEElQYli\SNI»QYPAVTNNNI RMLPDG 
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Predicted 
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location 
cone spondi ng 
to first 
amino acid 
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amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^ Alanine, C=Cysteine, D=>Aspartic Acid, E=* 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
B=HistidIne, I=l3oleucine, K^Lysine, 
L ^Leucine , M=Methianine, N^Asparagine , 
P^Proline, Q^Glut amine, R=Arginine, 
S=Serine, T-Threonine, VeValine, 
VI -Tryptophan, Y=Tyrosine, X -Unknown, *=Stop 
Codon, /=»possible nucleotide deletion, 
\=possible nucleotide insertion) 








LFQCKMiQO^IiLGKNSLMNLS PHVGELSNLTHREP IG \N YLETL 
PPBU^CQSLKIWCLIVEENLLMTLPLPVTERLQTCIiDKC 


5938 


395 


1865 


YKGEGFFCNQEARGEPJlKKKKAMSSPN IWSTGSSVYSTPVFSQK 
FmmiLIiI»I>SLYPGFTSQKSDDDYEDYASKrKTWVLTPKVPEGDV 
WIIiNNLLEGYDjS^LRPDIGVKPTLIHTDMYVNS IGPVNAINME 
YTIl}IFFAQTWYDRRLK3rNSTIKVL 

S KKADAHW I TT PNRMLRI WNDGR VL YSIiR LT IDA RCQliQIiHN P P 
MDEHSCPLEFSSYGYPREBIVYQWKRSSVEVGDTRSWRLYQFSF 
VGLRNTTBVVKTTSGDYVVMSVy FDLSRRMGYFTIQT^ PCTLI 
VVI^WVSFWINKnAVPARTSLGITTVLTrm^LSTIARKSLPKVS 
YVTAMDLFVSVCFI FVFSALVBYG\TLHYFVSNRKPSKDKDKKK 
KNPAPTIDIRPRSAT IQMNNATHLQERDBBYGYECLDGKDCASF 
FCCFBDCRTGAWRHGRIHI RIAKMDS YARI FFPTAFCLFNLVYW 
VSYIiYL 


5939 


66 


1404 


IRPGYLKBVQKNSPGHRAGLBPFFDFI VS INGSRLWKDNDTLKD 
LLKANVKKPVKMLI YS S KTIjBT*RETS VTPSRLWGGOGIJjGVS IR 
FCSFDGANBNVWHVLBVBSHSPAAIJ^LRPHSDYIZGADTVMNB 
SBDLFSLIBTHKAKPLKLYVWrDTDNCRByi ITPNSAWGGEGS 
LGCGIGYGYLURI PTRPFEEQKKISIiPGQMAGTPITPLKDGFTE 
VQI^SVNPPS)^PPGTTGIEQSIjTGI*SISSTP\PAVSSVIiSTGV 
PTVP\LIJPQYNQSLT5VPPMESSYI»HLPGLMPFTRQGLPH^ 
PSTFWLPRNPTHSWPGVGliYQEFVKPGVIiPPLSSMPPRNtPGVl 
APLPLPSBFLPSFPLVPBSSSAASSGELLSSLPPTSNAPSDPAT 
TTAKADAAS KLTVDVTP PTAKAPTTVEDRVGDS TP VSE KPVSAA 
VDANASESP 


5940 


145 


m 


rrsasrsasprqsagtavrtgtoaggtciiaaahh^^wradgr^ 
lbklpvhmgiiviteveqepsfsdiaslvvwcmavgisyisvydh 
qg i fkrnns rlmdr 1 1 »k0qq b r ilgijxs kys prfansndkddqv 
lwchlavkviis peix3kad i vraaqdfc^lvaqkqkrptdldvdt 
laWylvqmwlili 


5941 


13 


6147 


KCIXSRMGASSPP^PEPVGPPAPGLPFCCGGSIJ^VVVLLALPVA 
WGQCNA?RW\LPFARPTNLTDEFBFPlGTYIiNYECRPGYSGRPF 
SIICLIOISVWTG0UaRCRPJCSa^ 

IKYSCTKGYRLIGSSSATCIISGrnvIWmRTPICDRIPOGLPP 

CTSNDDQVGIWSGPAPCCI I PNKCTPPNVENGILVSDNRSJjFSIj 
NFA/VEPRCQrcF^VMKGPRRVKCX3ALNKWBPKI*PS CS R VCQPPPD 
VLHAERTQRDKDNF3 PGQBVFYSCEPGYDUM3AASMRCTPQGDW 
SPAAPTCEVKSC^DFMG^IJ^GRVI.FPWLQLGAKVDFVCnDBGF 
QLKGSSAS YC^/IAGMESLWNSS VPVCEQI FCPS PPVI PNGRHTG 
KPI^VFPFGKAVNYTCDPHPDRGTSroLIGESTIRCTSDPQGNG 
VWSSPAPRCGIIiGHCQAPDHFLPAKlJCTO^raASDFPiaTSLKYE 
CRPE YYGRPFS I TCLDNLVWS 3 PKDVCKRJKSCKTPPDPVNGMVH 
VITDIQVGSRINYSCTTGHRLIGHSSAECILSGWAAHWSTKPPI 
CQRIPTCLPPTIANGDFISTNRENFHYGSVVTYRCNPGSGGRKV 
FELVGHPS I YCTSNDDQVGIHSGPAPQCI rPNKCTPPNVENGIL 
VSIWRSLFSLNBVVRFRCQPGFVMKGPPJIVKCQAjUnCWEPBLPS 
CSRVCQPPPDVIJHAERTQRDKDNPSPGQKVFYSCBPGYDLRGAA 
SHRCTPQGDWSPAAPTCEVKSCPDFWQLLNGRVLFPVNLQIX3A 
KVDFVCDEGFQLKGSSASYCVIiAGMESLWNSSVPVCEQIFCPSP 
PVIPNGRHTGKFLBVFPFGKAVNYTCDPHPDRGTSFDLIGESTI 
RCTSDPQGNGVWSSPAPRCXSIIjGHOQAPDHFLPAKLKTQTMASD 

fpxgtslkyechpbyygp^fsitcxdni.vwsspkdvckrksckt 
ppdpvngmvh\h:tdiqvgsrii^sci^ghrlighssaecilsgn 
tahwstkppioqri pcglppti angdfistnr3nfhygs wtyr 
cnlgsrgrkvfelvgepsiyctshddqvg iwsgpapqci ipnkc 
tppnvfltoilvsdl^slfst^bvvbpr^pgpvmkgprrvkcoa 
i^tkwepblpscsrvcqpppeilhgbhtpshqiwfspgqevfysc 
epgydi^gaasljictpcx3dwspfaprcavkscddfixxaiphgrv 

LFPLt^IiGAKVSFVCDEGFR^KGSS\reHCVr,VGMRSLWNNSVP | 
VCEHIFCPNPPAILNGPJTK5TP^DI PYGKKISYTODPlIPDRGM | 
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SEQ 
XD 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cyeteine, D=Aspartic Acid, B=» 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H*=Histldine, I^Isoleucine, K=Lysine, 
Ii=Leucine, M=Methionine, N=Asparagine, 
P-Proline, Q=Glut amine, R^Arginine, 
S= Serine, T=Threonine, V=Valine, 
W-Tryptophan, Y«Tyrosine, X^Unknovra, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








Tfc'NIiIGBSTI RCTS DPHGNGVWSS PAPRCKI^ VRAGHCKTPEQ F 
PFASPTIPINDFKFPVGTSLNYBCRPGYFGKMPS ISCLENLVHS 
SVEDNCMK^CGPPPKPFNGMVHIim>TQFCSTVNYSCNEGFRL 
IGSPSTTCLVSGNNVTWDKKAPICEI1SCBPPPTISNGDFYSNN 
RTSPHtK5TVVTYOCHTGPDGEQLFBIiVGERS I YCTS KDDQVGVM 
SSPPPRCISTNKCTAPBVKNAIRVPGNRSFPSI,TBI IRFRCOPG 
FWVGSHTVQCQTNGRWGPKLPHCSRVCQPPPEXrjHGKHTLSHQ 
tWSPGQjBVPYSCEPSYDLRGAASI^CrrPQ^ 
CDDFl^l^HGRVIiPMII^LGAKre 

LAGMKALWNS S VP VCEQI FCPNPPAILNGRHTGT PLGDI PYGKK 

VSYTCDPHPDRGMTFNLIGBSTIRRTSBPHGNGVWSS PAPRCEI* 

PVGAACPHPPKIQNSHYIGGHVSIiYLPGMTISYTCDPGYLLVGK 

GFIFCTO»IWSQLDHYCKF^CSFPLFim5^^ 

GDYVTLKCEDGYTLEGSPWSQCOTDDRWDPPI^ 

VGTLSGTIFFILLIIFLSWIILKHRKGNNAHENPKEVATHIiHSQ 

GGSSVHPRTLQTNEENSRVltP 


5942 


4509 


688 

i 


YLYWMRANPLAYGISHKAYQIDPPL\RKHRKQ\L»VIE\VGRKL 
DK\AQMIRPBERTGYFSSTDLGRmBHYYlKYWTIETFNELFDA 
HKTEGDI FAI VS KAEEPDQI KVREEEI EELDTLLSNFCEIiS TPG 
GVENSYGKINILI^TYINRGEMDSFSLISDSAYVAQNAARIVRA 
LFEIAIJIKRWPTMTYRIjLNI*S KAIDKRLHGWASPIiRQFS I LP PH 
MLTRLEBKKLTTOKLICDm^ 

SVMMEAFIQP I TRTVLRVIXS I YADFTNNDQVHGTVGE PWWI WV 
EDPTNDHIYHSEYFLALIGCQVISK^QI»IjVFTIPIFEPI*PSQYY 
IRAVSDRWDGAEAVCI INFQHLILPERHPPHTEIJaDLQPLPITA 
LGCKAYEALYNFSHPNPVQTQI PTm.YHTDCNVLLGAPTGSGKT 
VAAELAI FRVFNKYPTSKAVYIAPLKALVRERMDDWKVRI EEKL 
GKK^IELTGDVTPDMKSIAKADLIVTTPEKWEX3VSR3WQNRNYV 
QQVTIIiI IDEIIILl^EERGPVLEVIVSR'mFISSHTEKPVRIVG 
LSTAIANARDLADWItfIKQ*TC^ 

HYCPRMASMNKPAFQAIRSHSPAKPVLIFVSSRRQTRLTALELI 
A?LATEEDPKQfWI»NMDERBMBNI IATVRDSNLKLTIAFGIGMHH 
AGI^ERDRKTVEmPVNCKVQV^IATSTlAWGVNFPAHLVI I KG 
TEYYIXSKTRRYVDFPITDVIiQMMGPJVG^ 

KKDFYKKFIiYBPFPVESSIjIX?VI>SDHIiNAEIAGGTIT3KQDAL»D 
YlTVnYFTRRI#IMNPSYYNLGDVSHDSVNKETjSHI*IEKSLIELE 
LSYCXBIGBDNRS I BPLTYGRIAS YYYLKHQTVKMFKDRIiKPEC 
STSELI^ILSDAEEYTDI^PVRHNED^IMNSEI^KCLPIESNPILSP 
I^PHTKAHLLL<ytfflji5RAMLPCT 

DVAANQGWLVTVLNITNLTQMVTQGRWLKDSSLLTLPN I ENIIHL 
HLFKKWKPIMKGPHARGRTSIBCLPELIHACGGKDHVFSSMVBS 
ELHAAKTKQAWNFIiSHLPBINVGISVKGSWDDLVEGHNELSVST 
L7ADKRDDNKWI KlJIADQEYVLQVSLQRVHFGFHKGBGPESCAVT 
PRFPR^KDEGWFLILGEVDKilBLIAIJCRVGYIRNHHVASriSFYT 
PEI PGRYI YTLYFMS DCYDGLDQQYD/NLSQRYTS ES FCTGQHQ 
GIi 


5943 


1 


2274 


DKPTRHKTYLSSSWA^OIAAAF^PVGDGFJLWQT^^ 

FGLKNQS PTKAEKPASSSI^SSPPPQIiTRNVVFGLGGKLPLWD 

GEDSSFI,VVRLRGPSGGG\EEPALSQYQRLLCINPPLFEIYQVL 

LSPTQHirVALIGIKGLMVIiEIiPKRWGKNSEFPXSGK^ 

VABRFPTSSTSLTLKHAAWYPSEILDPHVVLLTSDNVIRIYSLR 

EPQTPTNVI II^BABEF^LVLimSRAYTASLGBTAVAFDFGPLA 

AVPKTLFGQNGKDEWAYPLYI LYENG ETFLTY I S IXHS PGN/ 1 

WKAVGS IA1 IAS \AAEDNYGYT^CAVLCIiPCVPNI LVIATESGML 

YHCVVI^BEKDDHTSEKSWDSRIDLIPSLYVreCVBI^LAI^L 

ASGEDDPFDSDPSCPVKLHRDPKCPSRYHCTHEAGVHSVGLTWX 

HKLHKFLGSDEEDKDSLQEEiSTBQICCFVEHILCTKPIjPCRQPAP 

IRGPWIVPDI^PTMIClTSTYBCIilWPIJ^^ 

EDVEVAES PLRVLABTPDS FEKHIRS I LQRS VANPAFIiKAS EKD 

IAPPPEECl^LSRATQVFREQYITJCQDU^EXQRRVKLIiCTO 

KKKQuBI>LSYCKEBIlKSLR^MABPJ!«ADKYB 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleot ide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, ocysteine, D=Aspartic Acid, E=* 
Glutamic Acid, F- Phenylalanine, G=Glycine, 
H^Histidine, I^Isoleucine, KoLysine, 
L=Leucine, M=Methionine, tt^Asparagine, 
P=Proline, Q=Glutamine, R=>Arginine, 
SaSer ine, T=Threonine, V- Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








KLLHSFHSBLPVLSDSERDMKKELQLI PDQLRHLGNAI KQVTMK 
KDYQQQKMEKVLSLPKPTI I IiS AYQRKCIQS I LKEEGEH I RE MV 
KQINDIRNHVNF 


5944 


167 


342B 


PS lATFTDBPBVLTEPPSATTTTTIGISAmrTIJ^SHGKRNNT 
ITTTSS KRKNRKNKITPENVQI IFDDPLPIS YSQPEKVNGES KS 
SSTSESGDSDNMRISSCSDESS^SNSSRIffiDNHSPAVVTTTVSS 
KKQPSVLVTPPKEERKSVSGKASIKLSETISBGTSNSLSTCTKS 

gps piisspngkltvaspkrgqio^egwkevvrrskkvsvpstvi 
srvigrggcn i nai reftgah ididkqkdktgdr i itirggtb3 
trqatqlinaiilkdpdkbidelipknrlksssanskigssaptt 
taant3 lmgi km ttvals sts q1ataltvp ai s sasthktiknp 
vn\hvrpgftvsfp\latoppqfahaij»aaqtpqqirppri*pmt 
hfggtfppaqstwgpppvrpi^paratkspkphmvprhsmqwss 

GSOVNSAGSLTSSPTTTTSSSASTVPGTSTNGSPSSPSVRRQLF 
VTVVKTSNATTTTVTTTASNHl^^ 

SSPSPPAQPGGVSRNSPLIXX5SASPNKVASSSBQEAGSPPVVET 
TNTRPPNSSSSSGSSSAHSNQQQPPGSVSQBPRPPIiQQSQfVPPP 

P17DMTUDDT H»PtJ<S kIX\/»VDBTOVT>iriVPMDfYPPMfW^OP'TPKMRT 

PAIRPPPHGTTAPHKN3ASVQNS3VAVLSVNHIKRPHSVPSSVQ 
LPSTLSTQ3ACQNSVHPANKP lAPNFSAPLPFGPFSTIiFENSPT 
SAHAPWGGSVVSSQSTPESMLSGKSSYIiPNSDPI*HQSDTSKAPG 
FRPPLQRPAPSPSGI VNMDS PYGSVTPSSTHLGNFASNISGGQM 
YGPGAPLGGAPAAANFNRQHFSPLSLLTPCSSASNDSSAQSVSS 
GVRAPSPAPSSVPLGSEKPSNVSQDRKVPVPIGTERSARIRQTG 
TSAPSVIGSNLSTSVGHSGIWSFEGIGGNQDKVDWCNPGMGNPM 
IHRPMSDPGVFSQHO^MERDSTGIVTPSGTFHQHVPAGYMDFPK 
VGGMPFS VYGNAM I PPVAP IPDGAGGP I FNGPHAADPS WNSLI K 


5945 


1461 


197 


GVTHLFLFGKRKLRNGIAEDLKGQADFFFIiLVSEAVVATGSPRA 

WLTCXILPLPGIIPSVLPKAMSRPIJLITFTPAIT)PSDLWKX)GQQ 

QPQPEKPESTLOGAAARAFYBAliIGDESSAPDSQRSQTEPARBR 

KRKKRRIMKAPAAEAVAKGAEGRHGQGPvSLEABBKMTHRIIiRAA 

QEGDI*PBI»RRLI*EPHEAJGGAGGNINARDAFI3WTPI^ 

AAVSYIjIXIRGAAWVGVCEIjSGPJ)AAQIjAEKAGFP 

GETRSPENRSPTPSLQYCENCDTHFQDSNHRTSTAKIJ^SLSQGP 

QPPNLPIiGWISSPGFKLLIiRGGirePGMGI^PRGEGPJVNPIPW 

IJaUX}EGLGYRSAPQPRVTHFPAWI>TRAVAGRB \TP PRVATLSW 

REERRREE \KDRAWERDIiRTYMNLEF 


5946 


541 




ILGSYSSIQPEEYS \SWC\EVVLQDLLA\YVSP1C\HSYXRDLP 
SEGS PQRVNS IDFV\ PX\ EHLQPDVLVHAVLRVVDF / T I LTEAV 
YSYRGQKQKK\^TVEO^QDQHYALVLWGPGAAW\YPQIiQRKKG 
YIWEFKYLFVQCNYTLE1ILBLHTTPWSSCBCXFDDD1RAITFTO 
KFQKSAPS FVKI SDLATHIiBDKCSGWL IRAQ I SBLAFP I TASQ 
KIALNAHSSIiKS IFSSLPNIVYTGCAKCGLEI.ETDENRI YKQCF 
SCLPFTMKKIYYRPAIMTAIIXJRHDVCIRVF^KLIEKILLNISA 
DCL^^^VIVPSSRITYG^^AmDLFHSLIJ\VSAEPCVIJCIOSLFVL 
DKNSYPLQQDFSLLDFYPDIVKHGANARL 


5947 


3 


1317 


rgi pdrrrrgp igrvmdlenkvkkmglgheqgfgapclkckek 

cegfei^fwrkxo^cXnvakksm/tviasnkbdrkvgklfsdt 

kytxi»iaio>ksix3ipmylwnvmiltnpvm 

ppvqnqaijvrqymqmlp1u5kqpvagsrgaqyrkkqlakqlpahd 

qdpskcwelsprevkrmbqfvkkyxsealgvgdvkiipcemdaqg 

pkqiwipggdrstpaavgamedksaehkrtqysofccklsmkeg 

dpaiyaeragyl)klwhtacfv(^tchbiivdmiyfwknbkly^ 

rhycdsek^rc3vggdei»ifsney^abnqwhhij^ i 

lage i yvmvndkpvckpcywcnhav^ 

nfswhastecflcsc^kcligqkfmpvegmvfcsveckkrms 


5948 


39 


3370 


YRERYPVSGGSVLRSALEVCITOF1*SGI#TEGSLLPBGFFSGPIDQ 
GNHYQMRRK^CHRGSAARHPSSPCSVKHSPTOETLTYAQAQRM 
VE IE1EGRLHRIS I FDPLE I ILEDDLTAQEMSECNS3IKENSERP 
PVCLRTIO^HKKireVKKKNEALPSAHGTPASASAI^ 
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SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A Alanine, C=Cysteine, D=*Aspartic Acid, K« 
Glutamic Acid, F=Phenyl alanine, G "Glycine, 
H~Histidine, I=*Isoleucine, KaLysine, 
L=Leucine, M=Methionine, N-Asparag ine , 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=>Tryptophon, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








S PPS APRRP P VYYKF I EKS AE EIiDNBVE YDMDEED YAWbE IVNB 
KRKGDCVPAVS QSM FK FLMDRFBKESHCENQKQGEQQS L IDED A 
VCCI CMDGECQNSNVXLPCI>MCNIAVHQBCYGVWI PEGQWLC/ 
RAHCLQSRARPAIXrVLCPNKGGAFKKTDDDR>«5W\ VCALW\ I P 
K\VGPA^mrFIBPIIXJVI^PPARWKLT^a^rX3CEKGR/VGACI 
QCHKANCTTAFHVTCAQKAGLYMKMEP^ 

YCDVHTPPGCTRRPLNIYGIJ^MKWGVCRKESSVKTVRSTSKVR 

KKAKKAKKALAB PCAVIjPTVCAP YIP PQRI.NR IANQVAI QRKKQ 

FVEI^SYWIJ.K^LSPJ^PLI.RRI.QSSLQSQRSSQQRENDKHM 

KAAKEKLKYWQRIiRHDljKRAMiIilEIiI^KRBKL 

MBLRLTPLTVLIiRS VLDQLQDKDPARIFAQPVS IiKEVPD YI»DH I 

KHP*^FATTffiF^T j?i^yyYKNT fiBFRKI *FlJl>I IDNCMKYNARDTV 

FYRAAVRLRDQGGVVLRQARREVDSIGIiKI^G>im,PERPA 

RRPFSWEDVDRLLDPANRAHIX3LEEQLRELLDMLDLTCAMKSSG 

SRSKRAKIiLKKEIALLRNKLSQOIISQPLPTGPGLEGFKEDGAAI, 

GPKAGEEVLPRI*ETIXOPRKRSRSTCXJDSBVBBBSPGiaUjDAGL 

TNGFGGARSBQBPGGGLGRKATPRRRCASESSrSSSNSPLCDSS 

FNAPKCGRGKPALVRRHTLKDRSELISCIBNGNYAKAARIAAEV 

GQSSMWISTDAAASVI^PIiKVVWAKCSGYPSYPALI IDPKMPRV 

PGHHNGVTIPAPPIjD VLKlGEHMQTKSDEKIiFIiVLFFDN KRS WQ 

WLPKSKMVPLGIDETIDKIJaiMEXSPJJSSIRKAVRIAFDRAMNHL 

SRVHGEPTSDLSDID 


S949 


39 


3370 


YRRRYPVSGGSVLRSALBVCWDFLSGLTRGSLLPEGFFSGPIDQ 

GfimYQMRRKGRC^GSAARHPSSPCSVXHSPTRKTLTYAQAQRM 

VEIBIEGRLHRISIFDPLBIILBDDLTAQEMSECNSNKBNSERP 

PVCI*RTKRHKNNRVKKKNEALPSAHGTPAg AS ALPEPKVR I V3Y 

SPPSAPRRPPVYYKFIEKSAEBI^NEVEYDMDEEDYAWIiEIVNE 

KRKGDCVPAVSQSMFEFLMDRFEKESHCENQKQGEQQSLIDEDA 

VCCI CMIX3ECCNSNVILFCDMCNIAVHQECYGVP Y I PEGQWLC/ 

RAHCIX3SRARPADCVLCPNKGGAPKKTDDDRWGHV\VCALW\IP 

E\VGFAim^IEPIIX;VRNIPPARWKLT\CNLCKEKGR/VGACI 

O^HKTWCYTAPHVTCAQKAGLYMKh^ 

YCDVHTPPGCIRRPI^IYGIJVEMKNGVCRKESSVKI^^ 

KKAKKAKKALAEPCAVLPTVCAP YI PPQRLfc^ 

FVBRAHSYWI*LRRL3RNGAPLIiRRIjQ^ 

KAAKEKLKYWQRLRHDLBRARliLIKLLRKREKlKREQVK^ 

MBIJUjT PKIVLJiRS VXJXJLQDKDPAR 

KHPMDFATMRKRIiBAQGYKNIiHEFEEDFDL 1 1 DNCMKYNARDTV 
FTRAAVK^RDQf^VVl^QARPJ^VDSlGIJiKASGMHLPERPAAAP 
PJlPFSWBDVDRIJ^PANRAH3^I^EOI^LIJ5^DLTCWnCSSG 
SP^KJtfVKLIJCKEIAlJiRNKXSQQliaQPL 

GPEAGEE VLPRLBTL»LQPRKR SRSTCGDS EVEEESPGKRLDAGIi 
TNGFGGAR3EQEPGGGLGRKATPRRRCASESSISSSNSPLCDSS 
FNAP KCGRGKPALVRRHTIjEDRSEIiI SCI ENGMYAKAARIAAEV 
GQSSMWISTDAAASVLEPLKVVWAKCSGYPSYPALIIDPXMPRV 
PGKHNGVTI PAP PLDVLKIGEHMCyTKSDEKLFLVLFFDHKRS WQ 
WLPKSKMVPLGIDETIDKLKMMEGR1ISSIRKAVRIAFDRAMNHL 
SRVHGEPTSDLSDID 


5950 


1166 


373 


BSRS^TMSTSQPGACPCQGAASRPAILYALLSSSLKAVPRPRSR 
CLCROimPVQLCAPHRTCRBALDVIjAlCrVAFtJiNLPSFWQL 
DQRRLLG^CVKSPIiFLLGLAQDAVTFEVAEAPVPSILKKILriEEP 
SSSGGSGQLPDRPQPSIiAAVQWLQCCLESFWSLELSPKE \ YACL 
KGPILFNPDVPGIX}AASHIGin^EAHWLCEVI£PWCPAAQGR 
LTHVLLTASTLKS I PTS I*LGDI»FFRP I IGDVDIAGLLGDMLLLR 


5951 


143 


5449 


WNVKPSIiLWQLFKFSDKEEHEQNDS I SGKTGETGVEEMIATRK 
VEQDS KBTVKLSHEDDHI LEDAGSSDI SSDAACTNPNKTBNSLV 
GLPSCVDEVTCCNLEIjKDTMGIADOT 

ESNRQLESTE FHKSNLE VVDTST FGPESNILENAICDVPDQNSK 
GimiESTKJESHETANLQDDRNSOSSSVSYLESKSVKSKHTKP 
VIHSKQNMITOAPKKIVJUUCYBVTHSKX^ 
QQNFHRPVKVRKKQIDKJBPKIO^C^^ 
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SEQ 

• ilD 
NO: 


Predicted 
beginning - ' : ,\ 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D^Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine f G^Glycine, / 
H=Histidine. I=Isoleucine, K=»Lysine, 
LHdeucine, M=Nethionine, N-Asparagine, 
P=Proline, Q-Glutamine, R=Arginine, 
S=Serine, T^Threonine , V-Valine, 
W=Tryptophan t Y=Tyrosine, X=Unknown, *=.Stop 
Codon, /=poseible nucleotide deletion, 
\=possible nucleotide insertion) 








DQTLVQI FKPDTHSLSDKSHAHPGCLKEPHHPAQTGHVSHSSQK 

QCHKPQQQAPAMKTNSHVKEELEHPGTVEHFKEEDKLKLKKPEKN 

LQPRQRRSSKSPSLDEPPLPIPDNIATIRREG3DHSSSFESKYM 

HTPS KQ CGF CKKPHGNRFMVG CGRCDDW FHGDCVGLS LSQAQQM 

GEEDKEYVCVKCCAEEDKKTE I LDPDTIiENQATVKFHSGDKTME 

CEKI^^S KHTTNDRTIGf IDDTVKHKVKI LKRESGEGRHSSDCRD 

NE I KKWQI*APIjRKMGQ?VLPRRS SEEKS E KI PKESTTVTCTGEK 

ASKPGTHEKQEMKKKKV\BKGVI*rVHPAAS^ 

HS LKDILMKPJLTDSNIJCVPBEKAAKVATKI EKELFSFFRDTDAK 

YKNKyPXLMFNLKPPKNNILPiCKVLKGEVTPDHLIP^SPEETiTlS 

KE1iAAWRRRENRHTIEMIBKBQREVERRPIT:<ITKKGEIEIESD 

apmkbqbaameiqbpaankslekpeg5bk\rkeevdsmskdtts 
qhrqhtcti^ckicigrmappvddi,spkkvxvvvgvarichsdn^ 
absianabsstsmilaseffbeekqks pkst?spaprpempgtv 

EVESTFLARLNFIWKGFINMPSVAKFVTKAYPVSGSPEYIjTEDL 
PE>S I QVGGRI S PQTVWDYVEKI KASGTKE I CWRFT PVTEEDQI 
SYTLLFAYFSSRKR YGVAANNNKQVKDMYLI PLGATDKI PHPLV 
PPDG PGUSLHRPfQjLLGLI IRQKLKRQHSACASTSHIAETPESA 
PP I ALPPDKK3K1 EVSTEEAPEEENDFFNS PTTVXjHKQRNKPQQ 
NLQEDIiPTAVEPLMEVTKQKPPKPLRFI/PGVLIGWENQPTTLBL 
ANKPLPVDDII^U/nTGQVYlX)\A(^VMEQNTV^ 
TNSKIBKTDNVEVTIXtENKEIKYKVDNISESTDKSAEIETSVVG 
SS S I SAGSIiTSLSLRGKPPDVS TEAFLTNL3 IQSKQBETVESKE 
KTIJCRQI^EDQF^INI^DNQTSNSSPCRSN^ 
t.VANTAP^PQFI^KRDPRQAAGRSQPVTTSESKIX^SCP^rC3EK 
HMLPGLSHhnCEHLTEQ INVEEKJjCS AEJ<NSCVQQSDNLKVAQNS 
PSVENIQTrSQABQAKPLQEDIIJflQNI ETVHPFRRGSAYATSHFB 
VGNTCTSEFPSKSITPTSRSTSPRTSTNPSPMRPQQPNLQKiKS 
SPPGPPFPGPPNPPPQSMFGFPPHLPPPLLPPPGFG\EA\QNPM 
VPWPPW\HLP\GQPQR1WGPLSQASRYIGPQNFYQVKDIRRPE 
RRKSDPWGRQDQQQLDRPPNRG KGDRQRFYS DSHHLKRERHEKE 
WBQESE!UmRRDR£QDKDRDRKSREEGHKDKEPARLSHGDRGTD 
GKASRDSRNVDKKPDKJPKSKDYBKXJKEREKSKHREGEKDRDRYI1 
KDRDHTDRTKSKR 


5952 


3226 


639 


PPARRSARDLPRALSMEAARPSGSWNGALCRIAXLVTLXAFKr F 
ASDACKNVTLHVPSKLDAEKIiVGRVNbKECFTAANLIHSSDPDF 
QILEDGSVYTrNTILZjSSEKRSFTIXJfSNTBNQEKKKI FVFLEH 
QTKVI»KIQRHTKEKVLiRRAKRR WAP I P CSMLENSLGPFPLFLQQV 
QSDTAQNYTI YYS IRGPGVDQEPRNL FYVERJ}TGNLYCTRpVI5R 
BQ YES FEI IAFATTPDGYTPELPLPL I IKIBDENDNYPIFTEET 
YTFTI FENCRVGXTVGQVCATDKDEPDTMHTRLKYSI IGQVPP3 
PTLFSMHPTTGVITTTSSQLDRELIDK^QLKlKV^DMIXKJYFGb 
QTTSTCI INIDDVNDHLPTFTRTSYVTS VEENTVDVE IIiRVTVE 
DKDLVITOUWRANYTILKGNENGNFKIVTI^^ 
NYEBKCX3MILOIGVVNEAPFSREAS PRSAMSTATVTVNVEDQDE 
GPECNPPIQTVRMKENAEVGTTSNGYKAYDPETRSSSGIRYKKI. 
TDPTCWVTrDEim^IKVFRSLDIffiAETIKNGrraia^J^^ 
GRTCTGTLGIILQDVNDNSPF1PKKTVI ICKPTMSSAEIVAVDP 
DEPIHGPPFDFSLF^STSBVQRMWRLKAINI>IAARLSYQKI)PPF 
GSYWPITVRDRIiGM3SWSLDVTLCIX:iTENDCTHRVDPRIGG 
GGVQIiGKWAIIiAILLG IALFFCILFTIiVCGASGTSKQPKVI PDD 
LAQQNLIVSOTmPGDDKVYSANGFTTQTVGASAQGVanVGSG 
I KNGGQET I EMVKGGHQTS ES CRGAGHHHTU)S CRGGHTE VDMC 
RYTYSEWHSFTQPRLGEESIRGHTLIKN 


5953 


330 


811 


PLLCMPDPGWYNHVKQBSB I S KESQEMDARPKJjDLGFKEGQT I K 
IXnGNITNKK^ASKPRTARGGGLSLLPPPPGGKVTIPPPSS/V 
KLPSTNHVTPPS rPKSNHGGSDADTT.T.DLDSPAPVTTPAPTPVS 
VSNDLWGDFSTASSSVPNOAPQPSNVTVQF 


5954 


32 


2130 


PP P PP PKIiANMADLEAVIiADVS YLMAME KS KATPAARASKli I ^ 
PEPSIRSVMQKYIAEPJrciTFDKIFNQOGFliFKDFCLNEIKE 
AVPQVKFYEEIKBYEKLDNEEDRIXIRSRQIY^ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
HoHistldine, I = I sol eu cine, K= Lysine, 
I»=I*eucine, M»Mettaionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S-Serine, T»Threonine F V=*Valine, 
W=»Tryptophan, Y«=Tyrosine, X- Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PFS KQAVKHVQSHLSKKQVTSTLFQP YI B E I CES LRGD I FQKFM 

ESDKFTRPCOWKNVKLNIHLTMNEFSVHRIIGRGGPGEVYGCRK 

ADTGKMYAMKCLNKKRIKMKQGBTIiALHERIMI^ 

VC^ITYAFHtPDKIjCFILDLMNGGDLHYHLSQHGVFSBKEHRFYA 

TE 1 1 LGLFJ{MHHRFVVYRDLKPANILLDBH^ 

FSKJOCPHASVGTHGYMAPEVIiQKGTAYDSSA^ 

RGHSPFKQHKTXDKHBIDRMTLTVNVELPDTFSPFiKSLLEGIjIj 

QRDVSKRLGCHGGGSQBVKKHSFFKGVDWQHVYLQKYPPPDIPP 

RGEVNAADAFDI GSFDKEDTKG I KLLDCDQEL YKNFPLVI S ERW 

OQEVTETVYEAVNADTDKI EARKRAKNKQLGHEEDY7VLGKDCIM 

HGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGBSRQNLLTMEQ 

II^VBKIXJIKDKKCILPRIKGGJCQFVLQCESDPEFVOWKB^L^ 

TFIG^QRIJJlRAPKFLNKPRSGrrVBL^KPSLCHRNSNGL 


5955 


1726 


444 


KRBREFRIiAVCPLRYPSAYBSSPGTELRECGLCRSGQKFADCRR 

PANRQDVLSGWINLPVLQLTKDPIjKTPGRLDHGTRTAFIHHRRQ 

VHKRCINIWRDVGLTOVI^EIANSEEBVFEWVIcrASGWAIiAIiCK 

l^SLHGSLFPHLSI^KDLIAEFAQVTNWSSCa^RVFAWHPHT 

NKFAVALLDDS VRVYNAS STI VPS IiKHRLQRNVASLAWKPLS AS 

VLAVACO^CILIWTIJDPTSI^TRPSSGCAQVLSHPGfl 

W»PSGG3U4l^ASPVD!AAlRVWDVSTBTCVPLPWF^ 

SPDGS K II»ATTPSAVFRVWFJ\QMWTCERWPTl^GRC^rGCW S PD 

GSW^FTVIXSHPlAlYSbSFPFJfcaSBG!^ 

R«JYRHG^1VRRGLGERI.TPWSGTFVGNVP?LCL 


5956 


1705 


139 


GVGVRGAKAMATVQEKAAALNLSALHS PAHRP PGFS VAQKP PGA 
TYVWS S 1 INTLO/IXJVEVKKRRHRLKRHNIX FVGS EAVDV I FSHL 
I QNKYFGDVD I PRAKVVRVCQALMDY KVFEAVPTKVFGKDKKPT 
FEDS SCSLYR FTT I PNQDS QbGKENKLYS PAR YADALEKS 3 D I R 
S ASLEDLWENLSIjKP AN8 PHVNI SATLS PQVINEVWQEETI GRL 
LQLVDLPLLDSLLKQQEAVPKTPQPKRQSTMVNSSNYLDRGILK 
AYSDSQBDBWLSAAIDCSEYLPDQMVVEISRSFPBQPDRTDI.VK 
ELLFDAIGRYYS SREPLLNHLS D VHNGI AELLVNGKTE I ALRAT 
QIiliLKIJiDPQNREKFRRLI.YFMAVAAKPSEFKI^KESDWR^^^ 
RIFSKAXVDNKNLSKGKTDLLVLFL\MDHQKDVFKIPCT^ 
VS \ VK \ IiMAIQNGRDPNRDAG Y I YCQRIIX2RD YSNNTEKTTKDE 
1*LNLI*KTLDED£KI>S^^ 


5957 


1479 


451 


E LQVAVAWDTIjDRWKPKTKRAKRFLEKREPKLNEKI KNAML £ K 
GGNAHATVTKVIJCDVYALKKFYG\njYKJUCNlTRPFBP^ 
SKKSrKrSI,FMFGSHNKKRPNNLVlGR>rroYHVIJ3MIEI^ 
SLKDI KWSKCPBGTKPML I FAGDDFDVTEDYRRLKS LLIDFFRG 
PTVSmKIiAGI^YVWIFTAItfG^^ 
LEFJ^PSLDLVLRRTHLASDDLYKLSMKMPKAIiKP^ 
T?GTTYGRIHMQKQDI£KU^KM\KGLKKRPAERIT^HRKKS 
KRIKKKIiMELSQPLLFHCVIiLKRI IKHQS IQSFIi 


5958 


1 


3138 


AAALGMJ^WFPACQAFNIJjraKLTVYSGPKGSYFGYAVDFHlPD 
ARTASVLVGAPKANTSQPDIVEGGAVYYCPWPAEGSAQCRQIPF 
DTTNNRKI RVNGTKE P I E PK3NQ WFG\ATVKA\HKGKSCX5pVAP 
IJjFTWRNFLKP/TPEKGPVGTCYVAIQNFSAYAEFS PCGNSNADP 
EGQGY CQAGFSLDFY KNGDL I VGG PGS FYWQGQfVX TASVAD I TA 
NYSFKDILRKIAGBKQTBVAPAfiYDDSYLGYSVAAGBFTGDSQQ 
KLVAG IPRGAQNFGYVS IINSYDMTFIQNFTGBQMASYFGYTW 
VSDVNSIX3LDDVLVGAPLFMEREFESNPREVGQIYLYLQVSSLI, 

QRGKVLIYNGl^X^IJTOCPFPKFCWWASHAVPSGFGPTLRGD 

SDIDKNDYPDLI1/GAFGTGKVAVYRARPVVTVDAQLLLHPMI Iff 

I.ENKTeQVPDSMTSAACFSIJlVCAS\^^ 

S LKQKGAI BOiriiFLDNHQAIIRVFPLVIKRQKSIIQCQDF IVYIiRD 

ETEFRDKLSPINISLNYSLDESTFKEGXF^KPlI^YRENrVSE 

QAHILVDCGEDNLCVPDLJCLSARPDKHOVI IG3ENKLMLI INAR 

WEGEGAY EAELFVMI PEKADYVG I ERNNKGFRPIjS CEYKMENVT 

RMVVCDIX?NPffVSGTNYSIjGTJ?F^ 

NKDNPDSNFVSI.QXNITAVAQVEIRGVSHPPQIVLPIHNNBPER 
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SEQ 
ID 
NO: 


Predicted 
beginning } 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide , 
location 
corresponding 
to first 
amino acid 
residua of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A° Alanine, O Cysteine, D^Aspartic Acid, E=* 
Glutamic Acid, Phenylalanine, G=Glycine, 
H=Histidine, I=»Isoleucina„ K=Iiysine, 
I> Leucine, W=Methianine, N=Asparagine, 
P=Proline, Q=Glut amine , R=^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X«Dnknown, *«Stop 
Codon, /^possible nucleotide deletion, 
\=po3sible nucleotide insertion) 








BPHKEEBVGPIiVEHIYKLHNIGPSTISLrrrixKVGWPFSARDEFIj 

T.YIFKIQTLGPliQCQPNPNINPQDIKPAASP3:DTPEl>SAPLRNS 

TIPHtiVRKRDVHVVEFHRQSPAKILNCTNIECn^ 

GK SAVLKVRSRLWAHTFIjQRKNDP YALASLVS FEVKKM PYTDQ P 

AKLPEGSIAI KTSVIWATPNVS FS I PLWVT ILAILLGLLVLAI L 

TLALWKCGFroBlUlPPQEDMTDREQI*TNI)KTPEA 




1 


1166 


GTSGYAAQQLPSIiIiKFJIKFHLGTI^KVFASQHIiNHRQW 

NTIJVVDVQTSQITKIPIIJQ^REPGGV1XX)GCGIHM 

I^TGGDNPMSIAIYRI^TMPVCVGD^ 

AVS6SRDGS^WEVTDDVLTKSDJ«HNVSRW 

DIPKEimJPDNCKVRAlAAFNNKNKEI/^^ 

SKJLI^TKLPYCJIKNVCIJVYGSBWSVYAVGSQAHVSFLPPRQPSY 

NVKSVCBRBKGSGIHSVSFYBHIITVGTGQGSIjLFYDIRAQRFL 

BERLSACTGSKPRLAGBBII>KI/rrG\R^^ 

FPl^VYTHCYIWSGrrKLFVAGGPLPSGIiHGNYAGLWS 


5960 


2B53 


870 


FVWSIXKSPRPRKGPAVGAGAAHLSDPWAMTPGTANRATNPt^KK 

LDWAS INGFCEQI JJBDFEGPPIATRLLAHK1QS PQEWEAIQALT 

VIJ^CmSCGKRFHDWGKFRFI^LIKWSPKYIiGSRTSEKVK 

NKILELLYSWTVGI^EEVKIAEAYQMLKKQG\ IVKSDPKLPDDT 

TFPLPPPRPKNVI PEDEEKSKMLARLLKSSHPEDLRTUVbTKLI KE 

KVQEDQKRME KIS KRVHAIEBVNNNVKXLTEMVMSHSQGGAAAG 

SSEDL\MKBL\YQRCBRMRPTLFPTGRVDTEI)iro\EAIiAEXLOA 

NDNLTQVINL YT(X?LVRGKEVNGDATAGS I PGSTS ALLI>IjSGI*DL» 

PPAGTTYPAMPTRPGBQASPBQPSASVSLIiDDKLMSI^LSDPTrP 

PSGPSIJX^lWSFaSSDATEPPAPAIAOAPSMB^PPAQTSLP 

ASSGLDDIJ3IiLGKTIJ^SLPPBSC^VRWEKQQPTPRL 

NKSSSC3SPSSSATSIVUITVSPBPPRPPQQPVPTELSXASITVP 

LBS I KP SN I LPVTVYDQHG FH ItiFH FARDPL PGRS DVLVWVSM 

I^TAPQPIRNlVFQSAVPKVMKVKLQPPSGTEIiPAFNPIVHPSA 

ITQVMd^PQKEKVRIAYlOyr 

HGSIi 


5961 


198 


3147 


SGE PRPBPGNMATCIGBKIBDFKVGWLI/5KGSFAGVYRAES IHT 
GLKVA I KMIDKKAMYKAGMVQRVQNEVKIHCQL KHPS I LELYNY 
FBDSNYVYLVLEMCH^EMNRYIiKNRVKPFSENEARHFMHQI IT 
GMLYLKSMSILHRDUTLSMLLI/rRNMN 

KHYTIXXTrPKYISPEIATRSAK<3I^DVWSI^CMFYTLLIGRpP 
FDTDTVKNTItfKVVLADy^^ 

3LSSVLDHPFMSRKSSTKS1U5IX3TVEDSIDSGHATISTA1TASS 

STSISGSLFDKRRLLIGQPLPNKMTVFPKNKSSTDFSSSGDGNS 

FYTQWGNQBTSNSGRGRVI QDAKERPHSRYLRRAYSSDRSGTSN 

SQSQAKTYTMERCHSAEMr^VSiaiSGGGRNBBRYSPTD^^ 

KFFKEKTSSSSCSFERPI)NNQALSNHIiCPGKTPFPFADPTPQTB 

TVQQWFGNLQINARTJ?KTTETOSISPim^ 

WTDTKVKKNSD^DNAHSV^^ 

GSDPLSEQSKTRGMEPPWGYQNRTLRS ITSPLVAHRLKPIRQKT 
KKAWS tLDSEEVCVKI,VKEYASQEYVKBVLQI S SDGNTITI YY 

pngg\rgppla\drppspt\dnisr\ysf\dnlpekyvirkyqya 
s rfvql vrsks pk i tyftryakcilmens pgadfevwfydgvki 
hictedfiqvl^ktgksytlksesevnsiikeeikmymdhaneghr 
iclalbs i rsebbrktrsappppl iigrkpgstsspkalsppps 

VDSiWPTRDRASFinUWMHSAABPT 

TASGTDISSNSLKIX!LPKSAQIiIiK3VFVKNVG1^TQ\LTSGAVW 
VQFNDGSQLVVQAGVSSISYTSPNG^\TTR\YGENBKIiPDYIKO 
KLQCLSSILLMFSNPTPNFH 


5962 


20 


2447 


RVCSS SAS TASQAVfriAt>AWEE IRRLAADFQRAQ FAEATQRLS ER 
NCIEIVNKLIAQKQLBVVHTLDGKBYITPAQISKEMRDEXIIVRG 
GRVNI VDUQOVINVDLIHXENRIGDI I KSEKHVQLVLGQLIDBH 
YLDRLAEEVNDKLQESGQVTI SELCKTYDLPGNFXTQALTQRLG 
RIISGHIDLISTOGVIFTEAFVARHKARIRGLFSAITRPTAVNSL 
ISKYGFQEOl^YSVLF^VNSGRIiRGTVVGGRGPKAVFVPDIYS 
RTQSTWVDS F*FR0NGYLEFDA1»SRIjGI PDAVS YI KKRYKTTQLL 



413 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corr e sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
1 A- Alanine, C=Cys te *-ne, D^Aspartic Acid, E~ 
Glutamic Acid, ^Phenylalanine, G^Glycine, 
H^Biatidine, I=Isoleucine, R=Lyeine, 
Mieucine, M=Methionine , NaAsparagine, 
P«Proline, Q=Glutamine, R^Arginine, 
S~Serine, T-Threonine, V= Valine, 
W=Tryptophan, Y^Tyrosine, X~Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\ -possible nucleotide insertion) 








PLKAACVGQGIiVDQVRASVEEAI SSGTWVDIAPIJUPTSIiSVEDA 
AI LLQQVMRAFSKQASTVVFSDTVVVSEKF\ INDCTK L FRELMH 
QKAEKEMKNNPVHLITKKDLKQISTLKSVSTSKKDKKDKRRRKA 
TBGSGSMRGGGGGNAREYKIKKVKfCKGROTDnsDDKSQSSHTGK 
KKPEI S FMFQDB lEDFLRKHIQDAPEBFlSSIiABYLI KPLNKTY 
LBVVRSVFMSSTTSASGTGRKRTIKDLQEBVSKIiYNNIRLFBKG 
MKFFADim2AALTKBI>LKSVCTDlTN^^ 

AITSEIRKKILSKLSEKTKVAIjTKIjHNSLNEKSIKDFISCLDSA 
ARACI>lMVJ«GDKKRJ^QILFQIffiOAnAEQLKVT^PALILHLT 
SVLLFQFSTHSMLHAPGRCVPQI IAPLNSKIPEDQHALLVKYQG 
LVVKQLVSQSKKTGQGDYPIJ^IJ)K3QBDVASTTR^ 
SIKDLVLKSRKSSVTEE 


5963 




1130 


PWNPQDFPGNRGLM6\QKGBIGPP\GQQ3KKGAPeMP\GLMGSN 
GS PGQPGTPG3KGSKGE PG I QGMPGAS GliKGEPGATGSPGBPGY 
MGLPGI QGKKGDKGNQGEKGIQGQKGENGRQGIPGQQGIQGHHG 
AKGERGEKGE PGVRG AIGS KGESGVDGU4GPAGPKGQPGDPGPQ 
GPTOLIX5KPGREFSEQPIRQVCTDVIRAQIiPVLI»Q5GRIRNC3l)H 
CLSQHGSPGI PGPPGPIGPEGPRGLPGLPGRDGVPGLVGVPGRP 
OTRGIJCGLP<3RNGEKGSC«FGYPGF^PPGPPGPEGPPGISKEG 
PPGOPGLPGKIXSDHGKPGIQGQPGPPGICDPSI^FSVIARRDPF 
RKGPNY 




3 


2147 


SCRTRGRIjSPLQPRBAGSSRGSRARSEPPRPGGMEEACQVQTTK 

RGDPHELPJtn:FLQYASTEVDGBRYrrrPEDFVQRYI^LYm)PN8S 

PKIVQLLAGVAlXnTOXSLISYQEFIAFESVLCAPDSMFlVAFQL 

FDKSGNGEVTPENVKEIFXOTIIHHHIPFl^DCEFIPJ^HFGHNR 

KFMLNYTEFTQFLQEU^lARCAFALIGDKSKSGMISGIiDFSDI 

MVTIRSHMLTPFVEENLVSAAGGSISHQVSPSYFNAFNSLI^JNM 

ELVRKI YSTLAGTRKDAEVTKEEFAQSAI RYGQATPLEID I L YQ 

3^LYNASGPOiT!MXBRIAPlAEGAl>PYNI^LQROJ3SPGLGR 

PIWLQIAESAYRFTLGSVAGAVGATAVYPIDLVKTRMQNQRGSG 

SWGEIjMYKNSroCFKKVLRYEGFFGLYRGLIPQLIGVAPEKAI 

KLTVKDFVRDKF^RRI^SVPLPAEVIiAGGCAGGSQVI FTNPLEX 

VKIRLQVAGBITTGPRVSAIJJVLRDIjGI fglykgakacflrdi p 

FSA I YF PVYAHCK1jLIiADENGHV<GGLNl^^ 

TPADVI ETRI^VAARAGQTTYSGVrDCFRKIIi\RBEGPSAFWKG 

TAARVFRSSP<JPG\VTLVTYEI»LQRGFYIDFGGLKPAGSEPTPK 

SRIADLPPANPDH IGGYRIATATFnG I KNKFGLYLPKFKS PSVA 

WQPKAAVAATQ 


5965 


1 


1498 


MVTWLYRFLPTSNMAAXIjRSIXPPD^^ 
CGSYC^GAKASPLPGKMAMGLMCXSPvRELL^ 
SQWLGKPLTTRLIiFPAAP<X!CRPHYLPLAASGPRSLSTSAlSFA 
EVQVQAPPVVAATPSPTAVPBVASGBTADWCnAAEQSFAELGL 
GSYTPVGI.IQNLI^FMHVDLGI»PWWGAIAACTVFA^ PPLIV 
TGQREAARDINHLPEIQKFSSRIREAKIiAGDHIEYYEASSBMAIi 
YQXKHG I KLYKPL I LP VTQAP I F I SFF IALREMANLPVPSLQTG 
GLWWFQDLTVSDP I Y I LtPLAVTATMWAVLELGAETG VQS SDLQW 
MRNVTRMMPL I TLPITMHFPTAVFMYWLSSNLFSLVOVSCLR IP 
AVRTVLKI PQRWHDLDKLPPREGFLES FKKGWKNAEMTRQLRE 
REQRMRNQLELAARG PLRQTPTHNPIiLQPGKDNPPNI PSS\SSS 
SSKPKSKYPWHDTLG 


59ftt 


102 


1925 


RSKQVMARLTKRROADTKAIQHLWAAIE I IRNQKQ 1ANI DRITK 

YMSRVBGMHPKETTRQI^IAVKIXSLIVOT 

GYWLPGDE IDWETElflHDWYCFBCHLPGEVLI CDLCFRVYHSKCL 

SDEFRI^SSSPWQCPV(^lKXiamaQE«GTYIiRFTVSRMKE 

raidlnkkgkdn khpmyrrxvhs avdvptiqekvnegkyrs yee 
fkadaqli*lhntv1fygai^eqadiap^ 
kncfyianapjkdnwfcypci pnheldwakmkbfgfwpakvmqkk 
dnqvdvrffghhhqrawi pseniqditv^ 
cdei^lw3rflregrfwks knedpjgekeabss i sstsneqlkvt 
qkprakecgrrnosvbpkxeepbpf/teavsssqeiptmpqpiekv 
svstqtkki^assprmibrstot™^ 
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Predicted end 
nucleotide _ 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



Amino acid segment containing signal peptide 
(A;; Alanine, C=Cysteine, D-Aspartic Acid, E= 
Glutamic Acid, F=*Phenylalanine, G-Glycine , 
H=Hietidine, I=Isoleucine, K> Lysine, 
L=Leucine, M-Methionine, N=Asparagine, 
P^Proline, Q^Glutamine , R=Arginine, 
S^Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosinc, X- Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 

\=po3sible nucleotide insertion) 

KDRHKSDHKRETBRVVREAI*EKiiRSEMEB 

EMDRKCKQVKEKCKEE FVEE I KKIATQHKQLISQTKKKQWCYNC 
BBBAMYHCCWNTSYCS IKCX3QEHWHAEHKRTCRRKR 



SEQ 
IB 
NO: 



Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



5967 



102 



1925 



}^KQVMARI>TKRRQADTKAIQHLWAAIEIIRNQKQIANIDRITK 
YMSRVHGMHPKETTRQLSUWKD^ 

GYWLPGDEIDWKTKNHDWYCFECHLPGKVLICDLCPRVYIISKCL 
SDEFRLRDSS SPWQCPVCRS1 KKKNTNKQBMGTYliRFlVSRMKE 
I^IDI^^GKDNKHPMYRRI>VHSAVDVPTIQEKVNBGKYRSYBE 
FTCAOAQLLLHirnrrFYGADSEOADIARMLYK^ 
KNCPTfLMiARPDWWPCYPCIP^ELDW^^ 
Dl^VD VRPEGHHHQRAW IPSKNI QD I TVN IH2LHVKRSMGWKKA 
CDEI iK hHQRFLREGR FVJKS KNEDRGBEEABSS I S STSNEQLKVT 
QEPRAKKGRRNQSVE PKKEBPE PETEAVS SSQEI PTMPQP IEXV 
SVS TQTKKLSASS PRMIll^RSTXJTTNIX^CQS^KIHDKYTKI FNDP 
KDRMKSDHKRETERVVREA1»KKI*R3EMEEBKRQAVNKAV^ 
EMDRKCKQVKEKCIOEEFVBEIKELATQHKQIjI SQTKKKQWCYNC 
BBEAMYHCrwirPSYCSIKCTOElWHAEHKRTCRRKR 



5968 



1288 



VRPPRRGGAPPTVIiTPGRQQGVtTjGPQRPGSBPDIPARGQPHPP 
RPVGVSTS AQAQVQP PAMHRRRLALGIXJFCLIJVGTBIjSVLWVYIi 
ENWLPVSYVPYYLPCPEIFNMKiaYia^KPrX)PVVWSQYPQPKI> 
LEHRPTQLLTLTPWI^IVSEX^FNPBLIiQHIYQPLNLTIGVTV 

eavgn/helesabefwrgyiwhy/yiftd^ 

lssi piqahsl^weets^mrmetisohxakrahrevdyi j fcldvd 

MVFRHPWGPET'LGDIjVAAIHPSYYAVPRQQFPYKRRRVSTAFVA 
DSEGDFTYGGAVFGGQVARVYEFTRGCHmiLADiavNG IMAAWR 

esshi^rhtisnkpsf^j^peylwddrkpqppslklirfstldk 
discijis 



DVGFNI KRKRCDLDVFLESPRKPSGRR0RAPEKQRRIAANKCLC 
TGVREGEPPS/TTSQKVKEAGRDPTYJiIVVliFGISirGGLFYTI 
FXELFSSSSPSKIYGRALBKCRSHPEVIGVFGBSVXGYGEVTRR 
GRRQHVRFTEYVKDGLKHTCVKFYIEGSE PGKQGTV YAQVKENP 
GSGEYDFRYIFVEIE3YPRRTIIIEDNRSQDD 



5969 



1126 



533 



SQDNlGltRLJjQKHGWKliGQGLGKS LQGRTDP I PIWKYDVMGMG 

RMEMKI^YAEDATBRRRVLBVEKEDTEKLRQK^KDYVDKEKAIA 

KALKDLRANF YCEL CDKQYQ KHQK FDNHINS YDHAHKQRLKDLK 

QREPARNVS SRSRKDBIOKQEKALRRIjHELAEOJRKQABCAPGSGP 

MFKPTTVAVPBEGGl^DIGDBSATNSGTGATASCGIiGS E PSTDKG 

GPFTAVQJTfmrai^APGlJ^G«ISFGlK^^ 

SFAKZ<APVKLBSIASVFKDHAEEGTSEIX5TKP3EKSSDQGIX5KV' 

G1^IX?SSNI*IX5KKBDEDPQDGGSLASTI^KLKRMKRBEGAGA^ 

PEYYHYIPPAHCKVKJPNFPFTAFMRASEQMDGMn-raPKNAPES 

KKGSSPKPKSCI KAAASOGAEKTVSEVSBQPKETSMTEPSEPGS 

KAEAKKALGGDVSIXJSLESHSQKVSBTQMCBSNSSKBTSXATPA 

GKESQEGPKHPTGPFFP VLS KDBS TALQWPSELL IFTKAEPS IS 

YSCNPLYPBFKLSRNKDARTKGTEI^KDIGSSSKDHIjQGLDPGE 

PNKS KEVGGE KI VRSSGGRMDA PASGSACSGLNKQE PGGSHGSE 

TEDTGRSLPSKKBR^SGKSHRHKKKKKHKKSSKHKRKHXA^ 

SSKABSGEKSKKRKKRKRKKNKSSAPADSBRGPKPEPPGSGSPA 

PPRRRRRAQDDSQPJISLPABEGSSGKKDBGGGGSSSQDHGGRKH 

KGELPPSSCQRRAGTKRSSRSSIiRSQPSSGDEDSDDASSHRLHQ 

KSPSQYSEEEEEEDSGSEHSRSRSRSGRRH5SHRSSRRSYSSSS 

DASSDQSCYSRQRSYSDDSYSDYSDRSRRHSKRSHDSDDSDYAS 

SKHRSKRHKYSSSDDDY^LSCSC^RSRSRSHTRERSRSRGRSRS 

SSCSRSRSKRRSRSTTAHSWQRSRSYSRDRSRSTRSPSQRSGSR 

KRSWGHESPEKRHSGRRDPI RSKIYRSQSPHYFRSGRGEGPGKK 

DIXSSGDDSKATGPPSQNSNIGTGRGSEGIXSPBDKNSVTAKLIiL 

EKIQSlirofEI^SVSEBVO^TPNlGAGPKLKDPPQGY^^KIiPPS 

LGNKPVLPL IGKLPATRKPNKKCEESGL3RGEEQEQS ETEBG PP 

C^snnLFGHO^\3EETTGPLIiDPPPEESKSGBVTADHPVAPLG 

PPAHPDCntIi9BPTISHNYJJDP3DG!m»ESliDSSSQPGPVESSL 

LPIAPDIiEHFPSYAPPSGDPS IBSTDGAEDA\SIjAPLBSQPITF 



"557TT 



316 



4712 



415 
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ID 
NO: 


ri ecu. cue a 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


lr reCUUC lcu wjivj. 
nucleotide 
location; t : 
corresponding 
to first 
amino acid 
residue of 
amino acid ■ 
sequence 


(Alanine, C=Cysteine, D=Aspartic Acid, E== . 
Glutamic Acid, F=» Phenylalanine, G=Glycine, ' 
H=Histidine, I^Isoleucine, K=Iiysine, 
L^Leucine, M»Methaonine, 6T=Asparagine, 
Psspxoline, Q^Glut amine, R=Arginine» 
S=Serine, T=Threonine, V= Valine, 
^Tryptophan, Y=Tyrosine, X "Unknown, *-Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TPEEMBKYSKI^AAQQHIQQQLIiAKQVKAFPASAALaPATPAI, 
QPIHIQQPATASATS ITTVQHAJUjQHHAAAAAAAIGIHPHPHPQ 
PLAQVHHIPQPHLTPISLSHLTHSI I PGH PATPLASHP I HI IPA 
SA1HPGPFTFHPVPHAALYPTLLAPRPAAAAATALHLHPLI.HPI 
FSGQD LQHP PSHGT 


5971 


53 


2149 


SFIjYFVGVDMDNPIGNWDGRFDGVQLCS^ I 
PF^VTQERRPPKLAFMSRGVGDKOSSSHNKPKATGSTSDPGNRN 
RSSLPYTLNGSSVDSQPQSKSKNTWYlDKVABDPAKSbTEISTO 
PDR3SPPLQPPPVNSLTTENRFHSLPFSLTKMPNTNGSIGHSPL 
SLSAQSVMEBLNTAP VQBS P PIiAMPPGN SHGLRVG SIiAEVKENP 
PFYGVIRVf IGQPPGLNEV1AGI iBLEDB CAG\ CTDG*TF/ REGTRY 

f>*T»/"i a t wifc t warn i (# v tfP/mTmnnDkOT nm r^nmr fm/ntoT n tmdhv 

FXuUaKKAItf VKJjKSCRPDS RFASIjUP VSNU I ERLNS LAI WEAY 
LSKWEENTPTQKWRXEGLE IM IG\KRKGIQGHYNS CYtiDSTTiF 

YVCATKIMKLRKII>EKVBAASGFTSBEKDPEBFr^ILKHHILRV 
BPLLKIRSAGQKVQDCYFYQI FMEKNEKVGVPTlQQLLEWS FIN 
SNLKFAEAPS CI*I IQMPRFGKDFXIiFKKI FPSLKLNITDLLEDT 
PRQCRICX3GLAMYKCRBCYDDPni SAGK1ZQPCKTCNTQVHLHP 
IQOJIHKYNPVSIiPKDLPDWDWRBGCIP^^ 
VAPVKYGIO)DSAWLFFDSMADRI>3GQlSGFIfIPQVTPCPKVGBYIi 
KMSLEDIJlSI»DSRRIQCC2tf*RL^^ 


5972 


440 


1761 


ILIJVGSPSPRDQCSC^QSSGGDKEIiVTRGCTPSTAVVS PSAMTQ 
EPFREBLAYDRMPTLKRGRQDPASYAPuAKPSDLQLSKRLPPCF 
SHKTWVFSVIMGSCJiLVTSGFSLYIiG^FPAEMDYLRCAAGSCI 
PSAI VSFTVSRRNANVI PNFQII^FVSTFAVTTrCklWFGCKLVL 
NPSAlNINFNLIbliLLIiKLLMAATVI IAARSSERDCKKKKGSMS 
DSANILDEVPFPARVLKSYSWBVIAGISAVIGGI 1ALNVDDSV 
^PHXSVTFFWILVACFPSAJASIWAABCPNKCLVEVLIAISSI. 
TSPLLFTASGYLSFSIMRIVEMFKDYPPAIKPSYDVLLLLLLLV 
LLLQA/ GPQHGHRHPVRALQGQCKAAGCILGHPERPAGAPGWGG 
GQBPPEGVRQGESI^RRGANGPVTPRRGNRVAAPSLAPGMEl'H 
NP 


5973 


65 


2007 


NGDGKDLFGHIWAWRSNGI I SNFRRS PHAGMAEDE PDAKSPKTG 
GRAP PGGABAGEPTTLLQRLRGTI S KAVQNKVEG ILO^TVQKFS D 
NDKLYLYLQLPSGPTTGDKS S EPS TLSNBEYMYAYRWIRNHLEK 
HTDTCIiPKQSrVYDAYRKYCESLACCRPl^TANPGKI IREI FPDI 
KARRLGGRGQSKYCYSGIRRKTLVSMPPliPGLOLKGSESPEMGP 

nVi f/lf KJJBJjV JSAAUAIj IT_Un/in»rC 1 LuLKCr coo 1V5 VJ\tuf uJUUUn 

I*ISAR3AHWWIiKAMGLAKBDEHAPRKRSSKPia^L 
JCKPERItAQPPKDLEARTGAGPLARGBRKKSVVES SAPGANNLQV 
NALVARLPLLLPRAPRSLIPPI PVSPPlLAPRLSSGALKVATLP 
LS5RAGAPPAAVP I INMILPTVPALPGPGPGPGRAPPGGLTQPR 
GTENREVG IGGDQGPHDKGViGlTAEVPVS EASGGAPPAKAAKQD 
IEOTASDAKRKRGEPLKKSGGSGERN3TPLK3AAAMBSAQSSRIj 

pwetwgsggegwsaggaerpgpmgeabkgavlaqg\qgdgtvsk 
ggrgpgsqhtkeabdkiplvpskvsvi kgsrsqkeafplakgev 
mapqgmkdlkehvlqsslsqehkdpkatpp 


5974 


4293 


2200 


lglqmhttsgr ihqamvtslnednesvtvewiengdtkgk\b ID 

LESIFSLNP\DL\VPIX3EIBPSP\ETPPPPASSAKVNIcnmN^ 

TV\ASIKNDPPS\RDNRWGSARARPSQFPEQFSSAQQNGSV\S 

DISPVQAAKKEPGPPSRRKSNCVKEVEKLQEKREKRRLQQQEIiR 

BKRAQDVDATNPMYB I HCM IRDFRGSLDYRPLTTAD P I DEHR I C 

VCVRKRPUJKKBTW^LDVITIPSK^^ 

ENQTFRFD YAFDDS APNEMV Y RFTARPLVKT I FERGMATCFAYG 

QTGSGKXHTMGGDFSGKNQDCSKG I YALAARDVPLMLKKPNYKK 

LEI^VYATFFE I YSOKVFDIXNRKTKIJmiEDG 

REVXCVE3DVLKLIDIGNSCRTSGQTSANAHSSRSIIAVFQI ILRR 

KGKLHGRTSLIDLAGNERGADTSSADRQTRJLEGAB^ 

ECIPJU^RNKPHTPFRASKLTQVLRDSFIGENSRTCMIATISPG 

MAS CENTI^NTLRYANRVKELTVPPTAAGDVRP IMHHPPNQI \DD 

LETQWGVGS S PQRDDLKIjLCEQN BEKVSPQLFTFHEAVSQMVEM 



416 
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SEQ 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, O Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F-Phenylalanine, G==Glycine, 
H-Hietidine, I«Isoleucine, K=I*ysine, 
L^Ijeucine, M=Kethionine, N-Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=*Threonine, V= Valine, 
W^Tryp t ophan , Y=Tyrosine, X=Unknown, *=*Stcp 
Codon, /=:pos3ible nucleotide deletion, 
\t=possible nucleotide insertion) 








EEQVVBDHRAVFQESIRWLEDKKALLEMTEEVDYDVDS YATQLE — 
AI LEQ KI D I LTELRDKVKS FRAALQBEEQAS KQ I N PKR PRAL 


5975 


4293 


2200 


LGIXX<HTTSGRIHQAMVTSLNEI^SVTVEWIENGDTKGK\EID 
LESIFSLNp\DL\VPIXSIEP.<?P\ETPPPPASSAKVNKrVKNRR 
TV\ASIKNDPPS\RDNRWGSARARPSQPPKQPSSAQQNGSV\S 
DISPVQAAIOCEFGPPSRRKSNC^KEVBKIjQEKREKRRIiQQQELR 
EKRAQDVDATNPNYE I MCMIRD FRGSIiDYRPLTTADP I DEHRI C 
vcvrkrplnkketqmkdld VITI PSKDVVMVHEPKQKVDLTRYI* 
EKQTFRFDYAFDDSAPNEMVyRETARPLVETI FERGMATCFA YG 
QTGSGKTHTMGGDFSGKNQIX^S KGIYAIiAARDVEIjMLKKPNYKK 
LELQVYATFFEIYSGKyFT)LTiNRKTKLRVLEDGKQQVQVVGliQB 
REVKCVEDVLKLI D IGNS CRTSGQTSANAHSSRSHAVFQ I ILRR 
KGKI^KFSLIDLAGNERGADTSSADRQTRI^GA^ 
ECIRALGRHKPHTPPRASKLTQVLRDSFIGENSRTCMIATISPG 
HASCWTIjm^YANRVKEmVDPTAAGDVRPIMHHPPNQI \DD 
MSTQWGVGSS PQRDDLKLLCEQ^EEEVSPQIjFTFHBAVS QMVEM 
EEQWEDHRAVFQBS IRWLEDKXALtiEMTEEVDYDVDS YA1XJLE 
MLEQKIDILTELRDKVKSFRAALQEEEQASXQINPKRPRAL 


5976 


20 


2949 


VHHLHLTRVSWVNLDriLRIAQQMGIKTLNIiVIiG\LKRA\LEF 
PEVSWMEVKDPNMKGAMLTNTGKYAI PTIDA\EAYAIGKKEKPP 
FLPEBPSSSSEEDDPIPDELLCLICKDIMTnAVVIPCCGNSYCD 
ECIRTALliESDBHTCPTCHQND VS PDALIAtoKFLRQAVNNFKNE 
TGYTKRURKQI*PSPPPP I PPPRPL IQRNLQPLMRS P I SRQQDPL 
MIPVTSSSTHPAPS ISS LTSNQSSIiAPPVSGHPSSAPAPVPDI T 
ATVSISVESEKSIXSPFRDSDNKILPAAAliASEHSKGTSSIAITA 
I^EEKG YQ VP VXGTPS LLGQSI»LHGQLI PTTOP VRINTARPGGG 
RPGWEHSNKLGYIiVS P PQQ IRRGERS CYRSIHRGRHHSERSQRT 
GGPSLPATPVFVPVPPPPLYPPPPHTLPLPPGVPPPQPS PQFPP 
GQP\ PPAGYSVPPPGFP PAPANLSTPMVSSGVQTAHSNTIPTTQ 
APPLSREEPYREQRRLKEEKKKiCSKI^EFTNDFAKELMEYKKIQ 
KERRRS FSRS KS P YSG S S YSRS S YTYSKSRSGS TRSRS YSRS FS 
RSHSRSYSRSPPYPRRGRGKSRNYRSRSRSHGYHRSRSRSPPYR 
RYHSRSRSPOAFRGOSPNKRNVPQGETEFJEyFl^YREVPPPYDM 
KAYYGRSVDFRDPPEKERYREWBRKYREWYEKYYKGYAAGAQPR 
PSAHRENFS PERFT»PLNIRNSPFTRGRREDYVGGQSHRSRNIGS 
ITOBKI^ARDGHNQKDNTKSKEK^ 

KGEBSEGFI^EIJ^ETSRKSREPTGVEENKTOSLFVLPSRDDAT 
PVRDEPMDABS ITFK8VSEKDKPBRDKPKAKGDKTKRKNDGSAV 
S KKEN I V KPAKG PQEKVOG \ D VRDItLDI^NLXQLKKPKEE TP KDL 
TILNHHKPLRRMKKSL \ E P P\ EKLTIJJQGK\TPRNKTSQRGKS E 
EGLFQRCQ IRKANN 


5977 


1363 


1336 


FLEDRGQVLSHFO^I^IjHS inhilhpgagvaagpatgw/reylt 

PVLKES KFKETGVI TPEE PVAAGDHLVHHCPTWQWATGEELKVK 
AYLPTGKQFLVTKNVPCYKRCKQMBYSDELEAI ibeddgdggwv 
IJTYHNTG I TO I TEAVKE !TljE2iKDNIRX>QIX^ 
EAADMEEYEES<5I»LETDEATLDTRKIVEACKAK^ 
TRTYDLYI'j^DKYY(yrPRLWLFGYDB<)RQPLTVBHMYEDISQDH 
VKKTVTIENHPHIiPPPPWCSVHPCRHAEVMK3CIIETVAE(^ 
GVHMYLL I FliKFVQAVIPT I E YD YTRHFTM | 


5978 


160 


3213 


RDGARRWGGCQSPLTWAXGFYRRFDIATSGRRI^GC/TAEPAGRQ 
RPRRE PEAMDEQS VES IAEVFRCFI CMEKLRDARLCPHC3KLCC 
FSCIRIiWLTEQRAQCPHC3^LQIiRBLVNCKWABBVTQQLDTLQ 
LCSLTKHEENEKX>KCBNKiiBKLSVFCWTCKKCICHQCALWGG^ 
GGinFKPIAEIYBOHVTKVNEETVAKLRKKLMELISLVQEVERNV 
EAVRNAKDERVRE I RNAVEMM TARLDTQIiXNKIjI TLMGQKTSLT 
GETEJLLESLI^EVEIIQI^(^KSEI*ISKSSEI1^FQQVHRKPM 
ASFVTTPVPPDFTSELVPSYBSATFVLENFSTIiRQRADPVYSPP 
I^^GIX^miJCVYPDGNGVVRGYYI^VF^ 

VEMVHO^ PTKN I IREFASDFEVGECWG YNRFFRLDIJjANEG 

YLNP^JNI>TViraFQVRSPTFFO;KSro 

IHNlkERiTIBLSRTQraRDLSPPDNHI^PQ*© 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F»Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K= Lysine, 
L= Leucine, M=Methionine, N=Asparagine, 
P~Proline, Q=Glutamine, R«Arginine, 
S=Serine # T=Threonine , V=Valine, 
H=Tryptophan, Y^Tyrosine, X=Onknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








CSDMLLER\GP YSAS \VRBAKEDEBDEEKIQNEDYHEELSDGDL 

DLDLVYED EVNQLDGS S 5 SAS STATS NTEEND I DEETMSGENDV 

BYNNME3^EEGEIiMEDAAAAGPAOSSHGYVGSSSRISRRTHLCSA 

ATSSLIiDIDPLrLXHLIJ5LKDRSSIE^LWGLQPRPPASliLQPTA 

SYSRKDKDQRKQQAMWRVPSDLKMLKRLKTQMAETO 

TLSEI KBSSAASGDMQ^^SLFSADQAALAACGTENSGRLQDUSME 

IJAKSSVANCYIRNSTNKXSNSPKPARSSVAGSI^LRRAVDPGE 

NSRS KGDCQTLSBGS PGSSQSGSRHSS PRAL IHGS IGDILPKXE 

DRQCKMJ)SDA\AA'AVFSGLPAVEKRRKMVTIX3A^ 

QMTDLENNSETGBL<)PVLPEGA3AAPEEGMSSDSDIECI>TENEE 

CBBHTSVGGFHDSF>!VMTQPPDEDTHSSFPDGEOIGPBDLSFNT 

DBNSGR 


5979 


212 


3665 


LPDWl^ Y JL^I» KIJjAFGPAFLDTEVFVTGQS PTPS P TDAYLN ASE 
TTTLS PSGSAVI STTTIATTPSKPTCDEKYANITVDYLYNKBTK 
LFIAKIJKJVWBlsP^OG^^ 

pdktlildvppgvewpvhcx^Xqveqpdstiwlkwkjtietstc 

DTQN ITYRFQCGNMI FDNKEIKLENLE PEHEYKCDS EILYNSHK 
FTNASKIIKTDFGSPGEPQIIFCRSEAAHQGVITWIPPQRSFHN 
FTLCYIKETKKDCIJ^KI^IKYDLQNLKPYTKYV^ I A 
KVQRNGSAAMCOTTTKSAPPSaVWNNrrVSMTSDNSMHVKCRPPR 
DRMGPHKRYHI^VBAGNTLVRl^Hiaira)raVKDIiQYS 
AYFHITODYPGEPFIIjHHSTSYNSKALIAFIAFIjIIVTSIALIjVV 
LYKI YDLHKKRS CNLDBQQELVBRDDBKQLMNVEP IHADILLET 
YKRKIADEGRLFLAE FQS I PRVFS KFP I KEARKP FNQN KNRY VD 
ILPYDYNRVELSEINGDAGSNYINASYIDGFKEPRKYIAAQGPR 
DBT VDDFWRH2 WEQKATVIVMVTRCEEGNRNKCAEYWPSMEEGT 
RAFGECCCKDI,TKHKRCP\DYIIQKIiNIVNKKEKATGREVTHIQ 
FTSWPDHGWEDPHIiLLKLRRRVNAFS^^ 

TGTY IG IDAMLEGLEAENKVD VYGYVVKLRRQRCLMVQVEAQY I 
IxIHQALVEYNQFGETEVNLSEUIPYLilNMtCKRDPPSEPSPLEAE 
FQRLPSYP^RTQHIGNQE\ENKSKNRNSNVIPY1)YNRVPLKHB 
LEMSKESEHDSDESSDDDSDSEBPSKYINASFIMSYWKP\EVMI 
AAQGPLKETIGDF^QMIFQRKVKVIVMLTEIiKHGDQEr CAQYWG 
EGKO^GDIEVDIJCDTDKS STYTLRVEK 

YTNWSVEQLPAEPKELISHIQVVKQKLPOKNSSEGNKHHKSTPL 

LIHCPJKisQQIGIFC3a*LMI*I^SABTEEVVDIF0AWO^ 

GMVSTFEQYQFXYIDVIASTYPAQNGQVICKN^^ 

KVKQI3ANCVNPLGAPE1CLPPJVKEQA 

ASPALNQGS 




3 


2363 


DAWGCKLRRLR FTYGTQTR VS I^LPGQ YELVHTLVAHQGNWBTI 
PEBDLEVQENNBDAAHDLTELEVTMHHA^ 
PTVDVLGDLVNDFIJ>VITYALHKDEr,SERDEQELOElRKYFSFP 
VFFFKVPXLGSE 1 1 DS STRRMESERS P L YRQLIDLGY LS SSHWN 
CX5APGQDTKAQS MLVEQSE KLRHL6TFSKX)VLQTRLVDAAKALN 
LVHCHCLDI F1NQAFDMQRDLQITPKRLEYTRKKENELYESLHN 
lANRKQKEMKDMIVETLNTMKEKLr.DnATNMBFKDVIVPKNGKP 
VGTRE I KCCIRQIQELI ISRLNQAVANKL I EE VD YLRES FVGTL 
ERCLQSLBKSQDVS VH I TSNYLKQI LKAAYHVEVTFHSGS SVTR 
MLWEQIKQIIQRITWVSPPAITLEWKRKVAQBAIBSI*SASKLAK 
SIC^QFRTRIJSrSSHEAFAASLRQLEAGHSGRLBKTBDLWLRVRK 
DKAPRlARI^LESRSLQDVIjLHRKPKl^ 

wf^ran?Dr , aT.vc\nrDDr>wifHt)JMiriT.2iT t?wuvMDOT.t>WTTWDT irnr trr» 

n^ivjrtl* r LAJj IVo V V tr trUanXVnKt iMuWjCtCli i IHKjljrAlllSJtljVLIljflVj 

S VIDYNYGGGSS IAVLLIMBRLHRDLYTGLKAGLTLETRLQIAI, 
DVVEGXRFI^SQGLVHRDIKLKbmiLDKQNRAKITDLGFCKPEA 
MMSGS IVGTP1 HMAPSIi FTCKYDNSVDVYAPGI LFWY I CSGSVK 
LPEAFERCASKDHLVJNNVRRGAR PBRLP VFDEECWQIiMEACWDG 
DP LKRPLIX5 rVQPMLQG IMNRLCKS \NS KQPNRGLDDST 


5981 


1 


2519 


GRRHSAAMERPWGAADGI^RWK^IASIi^ 

DAPPPPAAPLPRWSGPIGVSWGLRAAAA\GGAFPRGGRWRRSAP 

G\EDEBCGRV^U)FVAKLANOTHQHVFDDIiRG 

I LVLTTPWPLVIMTFGQSKLyRSBDYGKNFKDITDLINNTFIR 
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| SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AcAlanine, C»Cysteine, D=»Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H^Histidine, I=»Isoleucine, K=Lysine, 
I*=»Leucine, M=Methionine, tNAsparagine, 
P=Proline, Q^Glutamine, R=Arginxne, 
S=Serine, T=Threonine, V=Valine, 
tf=Tryptophan, Y^ryrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








TKFX5I^IGPEKSGKVVLTAEVSGGSRGQRIFRSSDFAKNFVQTD 
LPFHPLTQMMYS PQNSD YLIALS TENGLHVSKNFGGKWHE IHKA 
VCLAKWGSDNTIPFTTYANGS CKADLGALELWRTSDLGKSFKTI 
GVKIYSFGLGGRFLFASVMADKI»TTRRIHVSXDQGOT 
SVGQEQFYS I1AANDDMVFMHVDEPGDTGFGTI FTSDDRG I VYS 
KSLDRHLYTTTGGETOFTNVTSLRGVYITSVI^ 
DQGGRWTHLI^PENSEa^TAXNKNRCSLHIHAS YS ISQKLNW 
MAPLSEPNAVGI VIAHGSVGDAI SVMVPDVYXSDDGGYSWTKML 
EGPHYYTILDSGGI IVAIBHSSRPlNVIKFSTDEGQCWQrYTFT 
RDPIYFTai*A3EPGARSM»IS IWGFTBSFLTSQWVS YTIDFKDI 
LBRNCEEKDYTIWIAHSTTIPKDYEn)GCILGYKEQFr,RLRKSSVC 
QNG3DYWTKQPS I CLCSLBDFLCDFGYYRPHNDSKCVEQPELK 
GHDIiEFCLYQREEHLTTNGYRXI PGDKCGGGVMPVRBVKDLKKK 
CTSNFLSPEKQNSKSNS VPI 1 1AIVGLMLVTWAGVL I VKKY VC 

ggrpl\^ysvi^hvaea\ngvix3vealdtash^ 
dedi*le 


5982 


56 


2316 


ATR?PRGSSHCO?QFSRTASAAPGRSNMLRIPVRKAIiVGIiSKSPK 
GCVRTTATAASNLI EVFVDGQSVWVBPGTTVLQACBKVGMQIPR 
FCTHERLSVAGNCRMCI.VEIEKAPKVVAACAMPVMKGWNI LTNS 
EKSKKARBGVMBFXiXiANHPI^CP lOXy^ECDLQDQSMMFGNDR 
SRPI.EGKRAVBDKinGPLVKTIMTRCIQCTRCIRPASEIAGVDD 
LGTTGRGNDMQVGTY IE KMFMS ELSGNI I DI CPVGALT3KP YAP 
TARPWETRKTES IDVMDAVGSNI WSTRTGBVMRIIiPRMHEDIN 
EEWISDKTRFAYDGLKRQRLTBPMVRNEKGLLTYTSWEDAIiSRV 
AGJMSFG^IOWAAlAGGLVDAEAIiVA^ 

VPPTAGAG TDI»RSN YX>I*NTTIAX3 VEEADVVLLVGTNPRFEAPL F 
NARlRKSWLHNDLKVAIiIGSPVDLTYTYDHLGDSPKILQDIASG 
SHP FSQ VIiKE AXKPMVVLGSSALQRNDGAAI LAAVSS IAQKIRM 
TSGVTGI^KVWWIIJIRIASQVAAIJDLGYKPGVKAIRKNPPKVT^ 
LLGADGGC I TRQDLP KDCF 1 1 YQGHHGDVGAP IADV I1»PGAAYT 
EKSATYVNTEGRAQC^KVAVTPPGLAREDWKI I RALSEI AGMTJj 
PYDTL\DQVRNRU^SPlIL\^DDIBG\AinrFXXJA^ 
QQIiLADPLVPPQLTMKDFYKTDS ISRASQTMAKCVKAVTEQAQA 
VBEPSrC 


S9S3 


248 


1763 


EARGIX5GRRRiIRASGRRAGRGF^\AGLKSCGQRAVPKRAVARGG 

RQ\YSAAXALLEPAGSBIADDLSILYSNRAACYLXEGNCSGCIQ 

DCKP^I^IPFSMKPI 1 LRRAMAYKTLKQYGKAYVDYK , rVJ J QIDC 

GLQIJVNI)SVNRI^RXIiMELIX3PNWREKIiSLIPAVPASVPLQAim 

P AKEM I S KQAGD S S SHRQOGI TDEKTFKALKBEGNG<^V1TOKNYK 

DALS KYSECLKINNKECAI YTNRAIiCYLKLCQFE EAKQDCDQAL 

QLADGirVK7^FYRRAIiAHKGLKNYQKS L IDLNKVI LLDPS 1 1 EAK 

MKI.KEVTRLI^TiKDKTAPFNKKKEKRKIKIQKVNEGKEEPGRPA 

GEVSTG<XASEKGGKSSRSPKDPFJO^PJJVKPNNAYEFGQIINAL 

STRKDKEACAHLIAITAPKDI^MFLSNlO^EGm'FIJ^ 

LIE1GD PS LVYQHIA YLSKAERFKMMLTLI SKGQKELI EQLFBDI* 

SDTPNNHFTLEDIQALKRQYEX 


5984 


755 


1193 


SSVCMACIWSNLGKKQRSVSFI^GLMRVSTC^ 

TGDVGRR I CRliLVGLFTKGDTSS KRVHPFS PGPCFLI>CDU\R VG 

SSPKINVSPFYQN\QTSTQRSCTVFVWQRCSLVGPFQVTVFTMY 

FHHSLRS 1 SRFSSG 


5985 


22 


1408 


RRVARPGTAEPAKARRTVPJ^RARRDLAGAERKAGVSERGDSGR 

RRPNPS I PSAAAGMSHIQI PPGl»TELLQGYTVEVLRQQPPDIiVE 

FAVF/YFTRLREARAPASVLPAATPRG^IiGHPPPEPGPDRVAnAK 

GD3ESBEJ)EDLEVPVPSRFMRRVSVCAETYNPDEEEEDTDPRVI 

HPKTDEQRCRTjQEACKDTLLFKPn^DQBQI^ 

BHVIDQGDDGDNFYVIBRGTYDI LVTKIWQTRS VGQYDNRGS FG 

ElJU^YNTPRAATIVATSFX^LWGIiDRVTFRRIIVKNNAKKRKM 

FF^FIFJSVPLIJ(SLEVSPJWKI^VIGBKrYKR/DGBRIITQGE 

KVADSFYI I HSGBVS ILIRSRTKSNKDGGWQEVBIARCHKGQYF 

GEIALVTNKPIU*A^YAVGDVXCI,V^^ 

NISHYEEQLVKMFGSSVDU3NLGQ 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
Corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A==Alanine, C»Cysteine, D^Aspartic Acid, B=; 
Glutamic Acid, F«Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N-Asparagine, 
P^Proline, Q=Glutamine, R=Arginine, 
S =Serine , T=Threonine , Valine , 
W=Tryptophan, Y=Tyrosine, X=>Unknown, *~Stop 
Codon, /rrpossible nucleotide deletion, 
\=possible nucleotide insertion) 


5986 


1806 


484 


DAWKSTS LTFHWKLWGRHRGRRRGLAHPKNHLS PQQGGATPQV P 

SPCCRFDSPRGPPPPRI^LIjGAIjMAEDGVRGSPPVPSGPPMEED 

GLRWTPKSPLDPDSGI^CTIiPI^FGGQSGPEGERSLAPPDAS I 

LISNVCS IGDHVAQEI»PQGSDLGHAEEAERPGEK\AGOHSPLRE 

E3J\rrCV0SIIiDRFl^\YGSLlPLSrDEVVEXLEDIPQQEPSTP 

SRKGLVI^LlQSYQRMPGNAMVRGFRVAYIG*H\n^ 

QNWL^VMNMYGDLVMiy^ 

VKRWTKNVDlFNKBLIJilPniLEVHWSI.ISVDVRRRTITyFDSQ 
RTI^RRCPKHIAIWIiQAEAVKKDRIiDFHQGWKGYFKMNVARQNN 
DSDCX5AFVLQ YCKHLALSQPFS PTQQDM P KLRRQIYKR LCHCKL 
TV 


5987 


1806 


484 


DAWKSTSLTFHWKLWGRHRGRRRGIJVHPKWHI^PQQGGATPQVP 
SPCatFDSPRGPPPPRLGrJ^AIMAEO^GSPPVPSGPPMEBD 
GLRWTPKS PLDPDSGLL8 CTLPNGFGGQSGPEGERSLAP PDAS I 
LTSNVCSIGDHVAQEIjFQGSDIXJMAEEAERPGEKVAGQESPIiRE 
EHVTCVQS ILDE FLQT\ YGS LI PLSTDEVVEKLEDI FQQE FSTP 
SRKGLVLQL I QSY QRM PGNAMVRGFRVA Y KRHVLTMDDLGTLYG 
QNVlLNDCfVMNMYGDL VM DTVPK K\ VHPPNS PFY\DKLRTKG YDG 
VKRWTKNVDI FNKELLLIPIHLE VHHSUSVDVRRPTITYFDSQ 
RTLNRRCPKH rAKYLOAKAVKKDRl*D?HQGWKGYFKMNVARQNN 
DSDCGAFVLQYCKHIiALSQPFSFTQQD^ 
TV 


5988 


1292 


410 


FKKYFLS FLGLLE S SHSRDR I HNLVLMFLLATHNLVWW FTCR FQ 
RII)CIYIiNAGIMPNPQI^IKALLFGLF^ 

LQEWETDVFGIIFriJRELEPLLCHSDNPSQLIWTSSRNARKSN 
FSLHDFQHSKGKEPYSSSKYATTDLLSVALNRNPMCK3GLYSNVAC 
PGTALTNLTYG ILPPF I WTIJjMPAI LliLRFPANAPTLTP Y 
ALVWLFHQKPRS LN PL I KYLS ATTGFGRNY1MTQKMDLDEDTAE 
KFYQKLLELEKH I RVTIQKTDNQARLSGS CL 


59B9 


194 


2610 


AMDPPQHSQHVLBQLNC^RQLGliLCDC^ 

ACSEYFKMLFATIXJKDVVHLDISNAAGLGQ^ 

nvddvlVavatplqmqdi ITACHALKSLAEPATSPGGNAKAIiAT 

EGGDKRAKEEKVATSTLS RLEQAGRSTP I GPSRDLKEERGGQAQ 

SAASGAEQTKKADAPREPPPVELKPDPTSGMAAAEAKAALSBSS 

EQEMBVEPAREGEEEQKBQEEOEBBGAGPAEVKEEGSQLBNGBA 

PEENENEESAGTDSGQELGSBARGLRSGTYGDRTESKAYGSVIH 

KCEIXrGKEFTHTGHFK^IRiHTGEKPFSCRECSICAPSDPAACK 

A1IEKTI IS PLKPYG CE ECGKS YRL I SLLNI iRKKRHS G KARYRCRD 

OGKLFTTSGNIjKRHQLVHSGEKPYQCDYCGRSFSDPTSKMRHLB 

THDTBKEHKCPHCDKKFNQVGNI,KAHLKIHIADGPLKC!RECGKQ 

FTrSG^KRJiliRIHSGEKPYVCIHQQRQFADPGALQRHVRIHTG 

BKPCQCVMOGKAPT0ASSI.1AHVRC2HTCEOT 

SQIANHIREOIt^RPHICCSVCSKAFVNVGDLSKHI I IHTGEKPY 

LCDKDGRGFNRVDNLRSIIVKTVHQGKAGIKII^ 

VDDM\n^LATEALAATAVTQLTVVPVGAAVTADB^ 

VKQVQEEDPNTHILYACDS CGDKFliDANS LAQHVR IHTAQALVM 

FQTDADFYQQYGP^3GTWPAGQVLQAGELVFRPRIX?AEGQPJ^LAE 

TSPTAPECPPPAE 


5990 


2 


4700 


FGPG P DSGGGARGSGWGS RSCAP YGTLGAVSGG EQ VLLHEEAGD 
SGPVS t<SRLGPS LRDKDLEMEELMIiQDETTiIiGTMOS YMDASLI S 
LIEDFGSLGEVEMSLPDPSWDFSPPSFLETSSPKLPSWRPPRSR 
PRWGOSPPPOOR QTWRRRREvn^F^nnTT.anRT.nNrv^s t pnpp 

±ri\.Ttvr\4*jir xr c%/yiv»3*JijJ&JSBOC( Vrior ouu JJj/\V7XVwJJi , » LVOOl true tr 

MHI»ACPEEEDKATAAEMAVPAAGDES I SSLSELVRAMHPYCLPN 
LTHLAS LEDELQEQPDDLTLPEGCVVLEI VGQAATAGDDLEI P V 
VWQVSPGPRPVLJJDDSLETSSALQLLM^ 
CEEKEGLSLNSEEKLDSAClJjKPPvEVVSPVVPKE 
G^QRARmRKKKSKBQPAACVBGYARRIiRSSSRGQSTVGTEVTS 
QVDNLQKQPQEELQKESGPIiOGKGKPRAWARAWAAALENSSPKN 
LERS AGQSS PAJ^GPIjDL YPKIjADTI QTNPI PTHL SLVDSAQAS 
PM PVDS VBADPTAVGP VLAGP VPVDPGLVDLAS TS SBLVE PLPA 
EPVLINPVLADSAAVDPAVVPlSDNl^PVmVPSG^ 
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to first 
amino acid 
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amino acid 
sequence 


Predicted end 
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location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-»Alanine, 0=Cysteine, D»Aspartic ' Acid, B«.;'"; 
Glutamic Acid, F«Phenylalanine, G=Glycine, 
H»Histidine, I=Isoleucine, K~I>yslne, 
I^Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R^Arginine, 
S- Serine, T= Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X~Unknown, *=«Stop 
Codon, /^possible nucleotide deletion, 
\-poesible nucleotide insertion) 


- 






DPVPNDLTPVDPVbVKSRPTDPRRGAVSSALGGSAPQljLVESES 
LDPPKTIIPBVKBWDSLKIKSGTSATTHEARPRPLSLSEYKHH 
RQQRQAETBKttSPQPPTGKWPSLPETPTGIADIPCLVI PPAPAK 
KTALQRS P ETPLK I CLVP VGPS PAS PSPE PPVS KPVASS PTEQV 
PSQEMPLI>ARPS PPVQSVS PAVPTPPSMSAALPFPAGGLGMPPS 
LPPPPI^JPPSLPLSMGPVI^DPFTHYAPLPSWPCYPHVSPSaYP 
CLPPPPTVPLVSGTPGAYAVPPTCSVPWAPPPAPVSPYSSTCry 
GPLGWGPGPQHAPFWSTVPPPPLPPASXGRAVPQPKMESRGTPA 

gppknvlplshapplslglpghgapg^teptkvevkpvpasphpk 
ekvsalvqspqmkaiacvsaegvtvekpasep^ 
kpplpatkavptprqstvpklpavhparlrki>sfi*ptprtqgsb 
dwoafisexgieasdlsslleqfkkseakkecpppapadslav 
gnsgg vd i pqekrpldrlojvpelanvagtlt ppatpphqx.wkpla 
avsllakaks pkstaqegti4cpbgvt3akiipaavrlqegvhgps 

RVHVGSGDHD YC\ VRSRTPPKK\ MPAMjI PEVGSRWNVKRHQDI 
TIKPVLSIXiPAAPPPPCIAASREPLDHRTSSEOADPSAPCLAPS 
Sr*LSPEASPCRira4NTRTPPEPSAKQRSMRCYRKACRSASPSSQ 
GWQGRl^GRNSRSVSSGSNRTSEASSSSSSSSSSSRSRSRSLSPP 
HKRWRRSSCSSSGRSRRCSSSSSSSSSSSSSSSSSSSSRSRSRS 
PS PRR3SDRRRRYSS YJWHDHYQRQRVI^KHRAIEE31RVVFIGK 
IPGRMTRSHLKQRFSVFGKIEECTIIIFRVQGDNYGFVTYRYAEE 
AFAAIBSGHKIJiQADBQPFDLCPGGRRQFCKRSYSDLJDSNREDF 
DPAPVKSKFDSLDFDTI*LKQAQKNIiRR 


5991 


334 


| 1379 


RbSSHFSQCS PS I YC\TKFDKQGNVTS FBRKKTKI»YQBLGljQAR 
DLRFQHVMSITVRNNRI IMRMEYLKAVITPBCIjI*IIjDYRNLNLK 
QWLFR2LPSQI*SGEGQLVTYPLPFEFRAIEAIiI^YWINTLQGKI» 
SII^PLILETI^AI J GDPKHSSVDRSKLHILLQNGK5;.LSELE , rDI 
. KI FKESIIiEILDBEELLEELCVSKWSDPQVFEKSSAGIDRAEEM 
ELLLENYYRIiADDLSNAAREliRVLIDDSQS I IF INIiDSHRNVMM 
RLI^LT^TFSLSLFGLMGWFGMNLESSLEEDHRIFWLITGI 
MFMGSGLIWRRLLS FLGR/LARSS IAS YGMKDMVHGG I VEGL 


5992 


2 


609 


MPDFRLVCGVSGSGFPGGRQGQATEWRPLRPWNGAMEKIiRRVI, 
SGQDDKEQGLTAQDSQINL/ SEVLDASSIjSFNTPJLKWFAICFVC 
GVFFS I I/3TGLLWLPGG I KLFAVFYTLGNIiAALASTCFLMGPVK 
QLKKMFBATRItlATlVMLLCTIFTLCAA 
FLSMTWYSLS YI PYARDAVI KCC8SI»LS 


5993 


1650 


594 


ABGLGSWAVWAGlUSWAGRHMEAGGATCAXfi 
SVAMDM FQKVK K IGRGTYG WYKAKNRHTGOLVAT »KK T R EDLEM 
EGVPSTAIRE I SLLKELKHPNIVRLIiDVVHNERKLYIjVFE FI»SQ 
DI*KKYMDSTPGSELPIiHTj I KS YLFQLLQGVS FCHSHR VTHRDLK 
PQNT.T.INELGAI KLADFGIJVRftFGVPT.RTYTHEVVTLWYRAPET 
LIATRFYTTAVDIWS IGCIFAEMVTRKALFPGDS \BIDQ\LFRI 
FRMLGTPS1S>TWPGVTQI»PDYKGSFPKW^^ PEG 
RDLLMQIMYDPSQRITAKTALAHPYFSSPEPSPAAP^YVI^^ 
RH 


5994 


394 


1934 


AGBVQLHVW1RGMRIQPQ/KAAAI IDLDPDFEPQSRPRSCTWPf, 
PRPEIANQPSKPPEVEPDI/5EKVHTEGRSEPILLPSRLPEPAGG 

LAQIYEWMVRTVPYFKDKGDSNSSAGWKNGIRIINLSLtlSKEIKV 
HNEATGKSSWWMLNPEGGKSGKAPRRRAASfTOSSSKI^RGRSKA 
PKKKPSGI^APPEGATPTSPVGIIFAKWSGSPCSRraF^ADMWTT 
FRPR S SSNAS <I V<;TRT.SPIiR PR^KVT AKKI PAS VSS YAGGVPPT 
l^EGLE IiLDGLN LTSSHSLLS RSGI*SGFS LQHPG VTGPLHTYSS 
SLPS PAEG PLSAGEGCFSSS QALEAI»LTSDTP P P P ADVLMTQVD 
PII^QftOTLIJJ^LPSSSKIATGVGI^ 

SM I APPPVMASAP I PKALGTPVLTP PTEAASQDRMPQDLDLDMY 
MEKLEm^IDNI ISDLMDEGEGLDFNFEPDP 


5995 


2 


2437 


RPPGPGPASGAWIXrrRARGSAAFVPPI*PRPPSRGARRRRRIiPGR 
GVAALRRGPGSAPGLPRGRAERSAAGSGRGPSREERGAAAAAAA 
AEMMEEIiHSL\DP\RRQEI»LEARP\'rGliGVSKGPLNSESSNQSL 
CSWSl^DKEVETPFJCKGllDQRNRKRKAB^ 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment contaxning signal peptide 
(A^Alanine, C»Cysteine, D=Aspartic Acid, E- 
Glutamic Acid. F»Phenyl alanine, G=Glycine, 
HoHistidinc, Iwlsoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=*Proline, Q=Glutaraine , R^Axginine, 
S=Serine, T=Threonine, v=Valine, 
W^Tryptophan, Y=Tyrosine, X=u"nknown, *t=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ISDYFERRVEQPLYGI^SAAKEATI^OSALPTIJ^VMLAKPRL 
DTEQLlAQRGAGLCFTFVSAQQNSPSSTGSGNTEHSCSSQKQISI 

qhrotNqsdltiekisalenskwsdlekkegriddllrancdlr 
rqi\deqqkmlekyk\erlnrcpdneprnfliekskqekmacrd 

KSMQDRLRIiGHFTTVRHGAS FTEQWTDGYAFQNL I KQQERINS Q 
REEIERQRKMUuXRKPPAMGOAPPATNEQKOj^KSKTWG 
TLAEYHEQEE I FKLRLGHLKKEEAB I QAELBRLKRVRNLHTREL 
KR IHNEDNS QFKDHPTl^DRYIJjIJILI^RGGFSE^nf KAFDLTEQ 
RYVAVKIHQLNKNWRJJBKKENYKTOACRBYRIHKEIiDHPRIVKL 
YDYFSIi>TDSFCrFVI^YCEGNDLDFYI^ 

IVNALKYLNEI KPPI IHYDLKPGCTILLVNGTACGEI KlTDPGLS 
KI WDDDSYNS VDGWBLTS QGAGTYWYIiP PECPWGKB PPI-GCSMK 
VDVWSVGVI FYQCLYGRKPFGHNQSQQD ILQENTIIiKATEVQFP 
PKPWTPEAKAFIRRCLAYRKBDR IDVQQLACDPYLLPH I RKS V 
STSSPAGAAIASTSGASNNSSSN 




1612 


981 


DQG^CLLGI^LTLEFGILEFDPSWIGSWTQR/SVfVSWRSRPGCB 
LFS I WFGS IVNEG YLKTSAS EGEEFCIYNRNPNACS YGVAVGVL 
AFLTCIjLYLAIiDVYFPQ I SSVKDRXK\ AVLSGHP WSGEPHPAA 
FWAFLWFTGDSCYL\ANQWQVS KPKDNPLNEGTDASPGRPSPFS 
FFS I FTWSLTAALAVRRFKDIiS FQEEYSTLFP\ASAQP 


5997 


1612 


981 


DdOACIMI^TLEFGII^FDPSWIGSWTQR/SWVSKRSRPGCE 
LFSIWFGSIVNEGYI^SASE<^EBFCIYNRNPNACSYaVAVGVl, 
AFI,TCIiYIiAIJ>VYPPQISSVKDRKK\AVr^aHTWSOEPHPAA 
FWAFLWFTGDS C YL\ANQW0V S KPKDNPLNEGTOAS PGRPS PFS 
FFS I FTWS LTAALAVRRPKDLS FQBEYSTLFP VASAQ P 


5998 


1612 


981 


DQQACIJiGI^TLEFGILEFDPSWIGSW'liQR/SWVSWRSRPGCB 
LFS I WFGS I VNEGYLNS ASEGEEFCI YNRNPNACS YGVAVGVL 
AFLTCLLYLALD VYFPQ I S S VKDRKK\ AVLSGHP WSGEPHPAA 
FWAFLWFTX5DS CYL\ANQWQVS KPRDNPLNBarDAS PGRP 5 PFS 
FFS I FTWSLTAALAVRRFKDLS FQEE YSTLPP \ASAQP 


5999 


2 


1790 


RPPMEKARRGGDGVPRGPVLHIVWGFHHKKGCQVEFSYPPLIP 
GDGHDSHTLPEEW KYLPFLALPDGAHNYQEI7IVFFHLPPRNGNG 
ATVFGI S CYR \Q IFAKALKVROAD ITRETVQKS VCVLS KL PLYG 
IJjQAKLQLITHAYFF^KDFSQISILKBLYEH^ 
OWLGI^PRDLVLHFRHKGLILFKLIIJ J EXI^FTtSPVNKLVG 
AIOTVl^LFPGMIEHGLSDCSQYRPRKSMSBDGGLQESNPCADD 
FVSASTAIWSHTNLGTIRKVMAGITOBDAAMKTBEPI^QVBDSS 
KGQEPNDTNQYLKPPSRPS PD3SESDWBTLDPS VLEDPWLKERE 
QLGSIXJTNLFPKDSVPSBSLPITVQPG^NTGQVVLIPGLISGLE 
EDQYGMPIAIFTKGYLCLPYMAIjOXJHHIiIjSDVTVRGF^ 
LFRCXJKHLSDAI VE VEEALIQ I HDPEIiRKLIiNPIT 
VRHVTENRDDV FLDGTG WEGGDEWIRAQFAVY I HALLAATTjQLV 
LFR I VWAKIQGNVMVTT\SRNWQTGK\AVGQSVGGAFS \ SAK 
TA\MS S WLSTFTTS TSQSLTEP PDBKP 


6000 


101 


1561 


TEP CRTAEfl CTATMS ENNKNS LKSSLRQLKCHFTWNLMEG ENS L 
DDFEDKV *r x Kl 1^ i^NRE FKATMClff jI^YLKHLKGO^EAALE CLR 

kabeliqqehadoj^irslvtwgnyalfvyyh 

vkhvc£kt^spyri3speij3cehx^rlkcggnqnerakvcfek 

alekkp knpeftsgiaias yt? t.dnwppsql^ idpi jiqai ulnpd 

nqylkvllalklhkmrebgeekg^ek\lvb^ 

lrsaa\kpyrgkx>epdkaiellkkalbyip\nnaylhcqigccy 

rakvfxjvmnijibngmygkrkllelighavahlkkadeandnlfr 

vcsi lasiihaladqyemeyyfqkefskeltpvakqllhlrygn 

fqlyqmkcrokaik^fiegvkihqksrekekmkdklqkiakmri, 

skngadsealhvlaflqeiotkmqo^edserc 

SWNGB 


6001 


176 


1038 


AFAHSPSRGHRKTHXHTPRHTPRCTMAESHLQSSLITASQFFE 1 
WUlPOAlX;£^YLEGKEl^NLIQKU}0ARi<XAGUiI>S PEMKTFVD 
QYGQRDDGK1GI VEIiAHVIiPTEENFIiI»LPRCXX2I»KS CE\EFMKT 
WRKYDTDHSGFI ETEELKNFLKDLLEKANKTVDiyrKLAE YTDLM 



422 



WO 01/53312 PCT/US00/34263 



SEQ 
ID 
NO: 
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location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine. C=Cysteine, D^Aspartic Acid, B=* 
Glutamic Acid, F=Phenylalanine, G=Glycine f 
H=Histidine, T=*Isoleucine, K=Ly»ine, 
L=>Leucine, M^Methionine, N-Asparagine, 
P^Proline, Q=Glutamine, R=Arginine , 
S=Serine f T=Thxeonine , V^Valine, 
W=Tryptophan , Y=Tyrosine, X^TJnknown, *=,Stop 
Codon, /^possible nucleotide deletion. 
V=possible nucleotide insertion) 








LKIiFDSNWDGKLKLTiiMAH (J, P VQRNFIjIjKFQG I KMCGKE FNKA 
PBLYDQDGNGYIDENELDALLKDLCEKNKQDLDINNI'ITYKKNI 
MALSDGGKLYRTDLAIiILCAGDN 


6002 


977 




LAPPGGGLHIPPRTPLSHSRPPPSHHAPHPSPliPLPPADLHPHS 
SMAQRSDLLBIjIX^LUTORVVWSBlDENUrRQSGLNRDVGSLDP 
EDLPXiYKEKLKVYFS PGHFAHGSDRRMVRLEDLFQRPPRTPMS V 
EIKGKNEBLIRBQ/VIfVRRYl)RNBITIWASBKSSVMKXCKAANP 
EMPLSPTISRGIAmJ*SyYLGLLPFIPIPfiKFFFCFI.PNriXNRT 
YPPFS CS CLNQLIA WS KWL IMRKSLI RHLE BRGVQWFWCLNE 
BS DFEAAFSVGATGV ITDYPTALRHYLDNHGPAARTS 


G003 


140 


4090 


GKIiRAFRGPH^LICKRI C0YKSFDDEESVDGNRPSSAASAFKVP 

APKTSGNPANSAJRKPGSAGGPKVGAGASKSGGAGAVDBDDFIKA 

F-rDVPSIQIYSSRELEETIJ^KIREn.SDDiamPTORAWAIiKKIR 

SLLVAGJUVQYDCFFOHLRLl^GALKLSAKDIxRSQVVREACITVA 

KLSTVLGNRFDHGAEAJVPT^ J 

RHTHVPRLI PLITSNCTS KS VPVRRRSFEFLDLLLQEWQTHSLK 

RHAAVI»VBTI KKG IHDADAEAR VEARKTYMGLRNHFPGEAETL Y 

NSLEPSYQKSIiQTYLKSSGSVASLPQSDRSSSSSQESLNRPFSS 

IWSTANPSTVAGRVSAGSS KASSI^SLQRSRSDIinWAAAGAK 

AHHAAGOSVRSGRUSAGALNAGSYASICTTSDKIJDGT^ 

RAKliSAPLAGMGNMCADSRGRSRTKMVSQSQPGSRSGSPGRVLT 

TTALSTVSSGVQRVLVNSASAQKRSKIPRSQGCSREASPERLSV 

ARSSR I PRPS VSQGCS REASRESSRDTSPVRS PQPIiASRRHSRS 

TGALYAPBVYGASGPGYGISQSSRLSSSVSAMRVIiN T rGSDVEnSA 

VADAIiLLGD IRTKKKPARRR YES YGMH SDDDANSDASS ACSERS 

YS3R1^5IPTYMRQT\BDV\ABVIiNRC?iSSiN5fS ERKEGLLGLQN 

LLKNQRTLSRVEIiKRLCEIFTRMFADPIIGKRVFSMFLETIiVDFI 

QVHKDDiXtfWLFVIiLTQIXKK^ 

FPNDLQFNII^RFTVDO/TQTPSLKVTCWTA^ ETLAKQMDPGD 
FINSSETRLAVSRVITWTTEPKSSDVRKAAQSVLISLFBLNTPE 
FTMLIiGAI>PKTFQ1X3ATKIiL^ 

RSPANWSS PLTSPTNTSQNTLS PSAFDYDTEKMNS BDIYSSLRG 
VTEAIQNP3FRSQFJ)^BPLKRDSKKDDG0SMCX3GPG\MSDPRA 
GQDATDSSQTAI»\DNKASLLHSMPTHSSPRSRDYNPYtrYSDSlS 
PFNKSALKKAMFBDDADQFPDDLSIiDHSDLVAELLKRI>SN}INER 
VEERKI AIi YBLMKLTQEES FS VWDKHFTCFIIjIXXJLRTDGDKBPT 
IRAXALKVLRE I LRHQPARFKN YAB LTVMKTLEAIIKDPHKEVVR 

saeeaasv\iatsi\speck:ikvix^iiqtadypihlaaikm^ 
kvjotvskbtlnli^peimpgliqg^ 

HAVIGDELKPHLSQLTGSKMICLLNLY I KRAQTGSGGADPTTDVS 
GQS 


6004 


140 


409B 


GKLRAFRGMRRL I CXRICDYKS FDDEBSVDGNRPSSAASAFKVP 
APKTSGNPANSARK^SAGGPKVGAGASKKGGAGAVDEDDPIKA 
FTDVPSIQIYSSREI£ETLNKrREII^DDKIiDWIX3RANALKKIR 
SII*VAGAAQYIXrFFQHLRLIJXSA^ 

HLSTVLGNK5THK3AEArVPTIJ^^ I 

RHTHVPRLIPLITSNCTSKSVPVRRRS FEFLDLLLQF^QTHSLE 

R1IAAVLVET I KKG I H DADABARVEARKTYMGLRNHFPG EAETIj Y 

NSLEPS YQKSLQTYIiKSSGS YAS LPQS DRS S SSSQRSLNRP FS S 

KWSTANPSTVAGRVSAGSSKASSLPGSLQRSRSDIDVNAAAGAK 

ABHAAGQSVRSGRIjGAGAIJ^SYASI^DTSDKLDCTASBDGRV 

RAKl^APLAGMGNAKADSRGRSRTKMVSQSQPGSRSGSPGRVLT 

TTAI^TVSSGVQRVLVNSASAQKRSKIPRSQGCSRBAS PSRLS V 

ARSSRIPRPSVSOGCSRKA3RBSSRDTSPVRSFQPLASRHHSRS 

TGALYAPBVYGASQPGYGISQSSRIiSSSVSAmVIiNTGSDVEEA 

VADALLLGD I RTKKKP ARRRYESYGMHSDDDAN5DASSACSERS 

YSSRNGSIPTYMRQTXEDVN^VIiNRCASSNWSERKBGLLG^ 

LLKNQRTIiSR VE LKRbCE I FTRMFADPHGKRVFSMFLETLVDFI 

QVHKBDIX}DWkFVIiLTQLLKKMGADLI^ 

PPNDI£FKII^FTVDQTOTPSIJCVK^ 

FINSSETRLAVSRVITWTTEPKESD\rRKAAQSVLISLFEIjNTPE 
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residue of 
amino acid 
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Predicted end 
nucleotide ; 
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corresponding 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A~ Alanine, C-Cysteine, D^Aspartic Acid, E«= 
Glutamic Acid, P^Phenylalanina, G=Glycine, 
H=Histidine, I^Isolaucirwj, K=LyBine, 
L=Leucine, M-Methionine, N-Asparagine , 
P=Proline, Q=Glutaraine f R=Arginine, 
S*»Serine, T^hreoiiine , V=Valine f 
W=Tryptophan , Y=Tyxosine, X=Unknown, *-Stop 
Codon, /=possible nucleotide deletion, 
Vpossible nucleotide insertion) 








FTMLLGALP KTFQIXIATKLLHNHLRNTGNGTQSSMGS PLTRPTP 

RSPANWSS PLTSPTNTSQNTLSPSAPDYBTENMNSED I YSSIiRQ 

VTBAIQNPSFRSQEDMKEPLKRDSKKDDGDSMC3GGPG \MSDPRA 

GGDATDS SQTAL\DNKASU^HSMPTHS S PRS RBYNP YNYSDS I S 

PFNKSALKEAMFDDDAIXJFPDDLSIjDHS 

VEFJtKIALYELMKLTQRESFSVWDEHFCT^ 

IRAliALKVLRB ILRHQPARPKNyAELTVMKTIiKAHKDPHKEVVR 

SAEKAASV\IATSI\SPEQCIKVI^PIIOTAI>YPINIAAIKMQT 

KVI ERVS KETLNLLL PB IMPGLI QG YDNSE SSVRKACVFCLYAV 

HAVIGDBLKPHI*SQI*TGS KMKIiLNIj YT KRAQTGSGGAD PTTDVS 

60S 


6005 


133 


5955 


RSSGRRQBQLGQFPGRERKGMASGLGS PSPCSASSBBBEMDAIiL 
NNSLP PPHPBNEKDPBBDLSETBTPKLKKKKKPKKPRDPKI PKS 
KRQKKFJRMI*I>CRQTiGDS SGEGPEFVEEBEBVALRSDSEGSDYTP 
GKKXKXKLGPKXEKKSKSKRKREBEBDDDDDDDSKEPKSSAOIX 
FJ)WGMEDIDHWSE^DYRTLTNYKAFSQFVRPLIAAKNPKIAVS 
KMMMVLGAKWREFSTNNPFKGSSGAS^ 

TBVAPPPPPVBVPIRKAKTKEGKGPNARRKPKG3 PEVPDAKKPK 
PKKVAPL KI KLGGFGSKRKRSSSBDDDI»DVBSDFDDAS INSYSV 


SDGSTSRSSRSRKKI^RTTKKJCKKGEEEVTAVIX^KroHQDYCKV 
CQQGGBI ILCDTCPRAYHMVCI*DPDMEKAPEGKWSCPHCEKEGI 

qweakeiwsegeeileevggdleeeijdmhmkfcrvckdggellc 
cotcpssyhihcr^pplpeipkgewlcprctcpaijcgkvqkili 
wkmgqppsptpvprppdm)pntpspkpi>egrperqffvkwqgms 
ywhcswvselqlelhcXqvmprnyqrkndmdeppsgdfggdebk 

S\RFKXNKDPKFAEMEERFYRYGIKPEW\MMIHRIIiNHSVDKKG 

HVHYLI KWRDIjP YDQAS WESEDVBIQDYDI»FKQS YWNHRELMRG 

E^RPGK30fKJCVICUlKliBia > PETPTVDPTVK^ 

TmPYQMEGLNWLRFSWAI^TDTILADEI^LGKIWrAVFLYSL 

YKEGHS KGP FLVSAPLSTI IN\WEREPEMWAPDMYV\VTYVGDK 

DSRAIIRE^FS\FEimiRGGKKASRMKKEASVKPHVI J LTSYE 

L I T I DMAI I *G S I D W ACL I VD KAHRL KNNQS KFFR VLNG Y S LQi IK 

UiTGTPLQMNLEBLFHWiNFLra^ 

QIKKIJIDMIX3\PHMLRRLKADVFK^P3KTEI^V 

KKYYK\YTLHSB3njKALN\ARGGGNQVSL^ 

LFPVAAMBAPKMPNGWYIXSSAIiIRASGK^ 

RVLIFSQhTTKMIjDLLEI)FLEHEGYKYBRIIX3GITGNMRQKA 

FNAPGAQQFCFIiLSTRAGGIiGINLATAD'rVI I YDSDWNPHNDIQ 

AFSRAHRIGQNKKVMIYRFVIRASVEF^ITO^ 

RPGIX3SKTGSMSKQEIJ5DIIiKFGTP^LFKDFATDG^DNKEGED 

SSVIRTDDKAIERlJ^RNQDETBI7rBLQC^BYI*SSFWAQYVV 

REBEKGEEEEVEREI I KQEESVDPDYWEKIiLRHHYEQQQBDLAR 

ND5KGKRIRKQVNYNDGSQEDRDWQDD0SDNQSDYSVASEEGDE 

DFDFJiSEAPRRPSRKGLRNDKDKPLPPLI^VGGNIEVUSFNAR 

QRKAFIJOtfMRYGMPPQDAFTO 

MRHI*CEPGADGAET FAIXJVPREGIjSRQHVL.TR I GVMS LIRKKVQ 

KFKIIWGRWSMPEI^KVEHWKKMSCPGSPSPKTPTPSTPGDTQP 

NTPAPVPPAFJXSIKIEF^SIJCEEE^IEGKKEVKSTAPETAIECT 

QAPAPASEDEKVVVEPPEGEEKVEKAEVKERTEEPMETKPKGKG 

A7UJVEKVEEKSAI DIiTPI WEDKBE KKEEEBKKEVM LQNGETP K 

DRTOBKQKKNIKQRFMFNlADGGFTBIiHSLW^EERAATV^ 

YEIWHRRHDYWIJaAvjl INHG X AK WUU X UHiii'K iAl SMS fcfr JU>BH 

NRGNFLEIKNKFLARRFKLLF^ALVIBEQL^ 

PSMALOTRFAEVECLAESHOHLS KESMAGNKPANAVLHKVLKQL 

EELLSDMKADVTRIaPATIARI PPVAVRLQMSERNJLSRLANRAP 

EPTPQQVAQQQ 


6006 


1 


965 


DNDb^RNTVHRHEPPVTAEPIRLLAENEDVVVVDKTSSrpVHPC 
GRFRHNTVI FIIXSKFJIQLKEI^PLHRIJDRLTSGVI^FAKrAAVS 
ERIHEQVRDRQLEKE WCKVEGKFPTEEVTCKEPI LVVS YKVGV 
CRVDPRG1G?aSTVFQRXSYNGQSSVVRC^ 
LGHPII^PIYNSVAWGPSRGRGGYTPKTNEELI^ 
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Predicted end 
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Amino acid segment containing signal peptide 
(A=Alanine, C-Cye teine, D^Aspartic Acid, E~ 
Glutamic Acid, P= Phenylalanine, G=Glycine, 
H=Histidine, I^Ieoleucine, K=Lysine, 
L= Leucine, M=Methionine, N=Asparagine , 
P^Proline, Q=Glut amine, R=Arginine , 
S=Serine # T=Threonine f V=Valine, 
W=Tryptophan, Y-Tyrosine, X=Unknown, *=Stop 
Codon, /-^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QSIiDVUQLCEGDI*SPGI,TDSTAP5SEriGKDDIiEELAAAA\QKME 
KVABAAPQEI*DTIAIJVSEi«VETDVMNQ\RQT\TLCRVPAGATG 
SLAPRPCDVPTCPTL 


6007 


3 


2351 


HEIjGQVEYVFTDKTGTLTENEMQPRECS ingmkyqbingrlvpe 
GPTPDSSBGNLSYLSSLSlILNNIiSHLTTSSSFRTSPBNETEblK 
EIlDLFPKAVSLCHTVQINNVQTDCTGI^PWQSNTAPSOLErfM 
SPDEKALVEAAARIGIVFIGNSEETMEVKTIiGiaiERYKU>HILE 
FDSDRRRMSVIVQAPSGEKLIiFAKGAESS ILPKCIGGE IEKTRI 
HVDEPAL103IiRTLCIAYPJCFTSKF^re 

EKJ AAVFQPI EKDt»ILLGATAVEDRIX3DKVRKT I BALRMAGI KV 

QLARRlTBDHVIQHGLVVDGTCtiSI^^ 
LCCRMAPLQKAJCVT3^IiaSPEKPIT^ 

GIGIMGKEGRQAARNSDYAIARFKFLSKLIiFVHGHFYYIRIATIj 
VQYFFYKNVCFI TPQFLYQFYO^SCKyrijYDSVYLTIj Y \N I C FT 
SLPILX YSLIjBQHVDPHVliQNKPTI>YRDISKNRIiIfS IKTFI»YWT 
ILGFSHAFIPFFGSYI>LIGKmrSlXGNt3QMFGW 
VITVTVKMAI^rHFWTWliraLVTWGSIIFYFVFSLF 
LGSQNMYPVPIQLLSSGSAWFAI ILMWTCLFLDI IKKVFDRHTi 
HPTSTHCAOJ/rETNAGIKtt^ 
GRCSPTHISRSWSASDPFYTNDRSILTI*STMDSSTC 


600B 


4554 


1089 


AGVRRAGARRGPGRALPAGATAVp PFSARRRRRCPAPEHAGPAR 

ASRPSQF*TMFQLPVNNLGSLRKARKTVKKJ^ 

DFKQ FE PNDFYLKOTTWEDVGLWDPSLTKNQD YRTKPFCCS ACP 

FSSKFFSAYKSHFRNVHSKDFKNR I LLNCPYCTFNADKKTLETH 

IKJEFHAPNASAPSSSLSTFia3KNKNIX5LKPKOADSVEQAVYYCK 

KCTYRDPLYE XVRKHI YREHFQHVAAPYIAKAGEKS LNGAVPLG 

SNAREESS IHCKRCLFMPKSYKALVQHVIEDHERIGYQVTAMIG 

HTNVVTORSKPLMLIAPKPQDKKSMGLPPRIGSLASGNV\RSLP 

SQCJWNRI^IPKPNIiNSTGTOM>4SSVHIj^^ 

GQSMRLGLGGNAPVS IPQQSQSVKQLLPSGNGRSYGLGSEQRSQ 

APARYSLQSANASSLSSGQLKSPSLSQSQASRVLGQSSSKPAAA 

ATGPPPGNTSSTQKWKICT ICNKLFPENVYSVHFEKEHKAEKVP 

AVANYIMKIHNFTS KC^jYCNRYLPTDTLLNHML IHG LS CPYCRS 

TFNDVEKMAAHMRMVHIDEBMGPKTDSTIjSFDLTLQQGSHTNIH 

I^VTTYNLRDAPAESVAYHAQNNPPVPPKPQPKVQBKADIPVKS 

SPQAAVPYKXDVGKTI^LCFSILKGPISDALAHHI»RFJffl^ 

TVHP VEKKLTYKC I HCLGVYTSN^r^7^T ITLHLVHCRGVGKTON 

GQDKINAPSRI^QSPSIJVPViraTYEX^F 

SFFF^KPEEPVVIJu^PKGH\EDDSYEARKSFLTKYFT\KQPYP 

TRREISKIiAASIW\WK\SDIASHFSNKRKKC^DCEKY^ 

LGFNMKELNKVKHEMDFD^GLFENHDEKDS R VNAS KTADKKJLN 

LGKEDDSS5DSFBNI£E3SNHSGSPFDPVFBVBPKISNI»TPBBfl 

VLKVIPEDASESFJ3HXDQKFJDGSKYT^ 

EVDQDDVVEKKI)GASPSF^GPGSO^VSDFEDNTCEMKPGTWSDE 
SSQSEDARSSKPAAKlGraT^KX?DREOIJCW^ 
QSQVRNASENDERLSNPQI EWQNSTIDS EDGEQFDNMTDG VAE P 
MHGSLAGVKLSSQQA 


6009 


4272 


1534 


CHGLQHLTP FRELNLSLQG*EPH*AA*QAVRSEEKS IC*GSPSC 
HLVtGVLVPVARO^SHSAGPAQSAFR*TGTGSGTPKAAEQSGYW 
FAYTLGHQHWNMFPIQRPPLVMKGRRIMGGKCKKG*VSDSVTQG 
RAVAGEQAS QRRTVFTAGGGECI^AKS VRA5 VFTGNQPGVMGLTj 
NGKRGGCFESGYLFGFIVIGKIQSLKAKVPIiPVNGQTGERASPG 
NCRni I VDAVC * SEHH *DHFIiAAAFLENS TI I S * VAPGS WQDHA 
VI^KF^QASVRCRGFBSViyrAPAGFWAHSPPGlX^F^TTTSVSL 
FVIAPQIX5RG VPFVEGQLVTVLGLVVPQS IRHTFVHHTQLFLHP 
I*I01iGALDVAFLHLLTLVC5SK 

AVTRGSAVQRRPS ITI SS IHVDTKIQQELHDVMVAGADGVVQWG 
DPFWGLAQI FHI*IDDPIjHOIELSFQRRV*EQCQGVKPDSQPVP 
RPLRVGLI^GPLVRGGGRRVAGRGKRC^mDLIiFPWRWGLSHRT 
t^LX/RGGDRGHVVVIVliaiLGSLVGGLGTDELLHFGC'K*I ,I I IG 
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corresponding 
to first 
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residue of 
amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, (^Cysteine, D=Aspartic Acid, E*» 
Glutamic Acid, ^Phenylalanine, G=Glycine/ 
II=IIistidine, I~Isoleucine # K=Lysine, 
L=Leucine, M«Methionine , tf=Asparagine, 
Pt= Proline, Q=Glut amine, R^Arginine, 
S=Serine, T=*Threonine , V=Valine, 
WaTryptophan, Y- Tyrosine, X=Onknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\ -possible nucleotide insertion} 








1 * * RGRLSGBWGCGIX5RGELPQVSIGIGVS IVHIGQGDHEVDGG 
AGL VERGALHATG(X5VEAIjV(X}LIiDVGPAGAlX5LCDGAALF^ P 

grvgqlpahglqvcitlvaqwrmhdgretjggakwpwqalhgaai 
cgvggaj:llkai.suyfi j kgg*rlwcargq*pvkkrqrrwrg*tr 

R*NGLTIHCFN*LI *GAVCC^VILRWCGLLBVHGVYGT* IHCL 
GS FPGRLWP * PFI SQERPNGHCQWB FRLAVPS WKCRWSRWRVRG 
TWRYGNPIiLNLL*GAWLGGAA0GGQQGGPLSTWQACTGPGOA^ 
IiPPFQGACRPRTQRCRTWCPIAWRQLLAYTRD 


6010 


1 


3533 


IMPCGSSRLIiRGCWTHPNEPVSDLSYFIX:iESVM^SKVLGKSM 

AGISQNAKTGDLPAJreECVGIASKAbCGLTKAAAQAAyi»VGIFD 

PNSQAG3IQGLVDPIQFARANQAIQMACQNLVDPG5SPSQVLSAA 

TIVAKHTSAGCNACR I ASS KTAN PVAKRHFVQSAKKVANSTANL 

VKTI KAIJX3DFSEDNRNKCR1ATAPL IEAVENI.TAPASNPEFVS 

IPAQISSBGSQAQEPlLVSAKPMLESSSYIiIRTARSLAIlfPKDP 

PTWSVLAGHSH7VSDSIKSLITS IRDKAPGQRECDYSIDGINRC 

IRDIBOASLAAVSQSLATRDDXSVEALO^QLTS VVQEI GHL IDP 

I ATAARGBAAOIiGHKGTQIiAS YFB Pl» I LAAVGVASKIT J0HQQQM 

TVLDQTK7ITJVBSALQHL YAAKEGGGNPKA^ 

RAVDD I MVTT tNEAASEVG^VGGWVDA^ AEAMS KLDEGTPPE P KG 

TFVDYO/ITWKYSKAIAVTAQEMMTICSVTNPBEIX^I^Q^TSD 

YGHLAFQGGMAAATAEPBEIGEX3IRTRVQDLGHGCIFI>VQKAG\ 

ALQVCPTDSYTKRKLIBCARAVTBKVSL^^ 

TAATAVSGI IADLDTTI M PATAQTIiMAENS BTFADHREN ILKTA 

KALVEDTKIjTjVSGAAST PDKLAOAAQS SAATITQLAEWKLGAA 

SLGSDDPETQWLINAI KDVAKALSDL ISATKGAASKPVDDPSM 

YOIiKGAAKVMVTtlVTSLIiiCr^ 

ELTVFQSKDVPEKTSS P RES IRMTKG I TMATAKAVAAGNS CRQE 
DVIATANLSR KAVSDMLTACKQAS FHPDVSDE VRTRALRFGTEC 
TLGYLDLLEHVLVT r^KPTPELKO^LAAFSKRVAGAVTEI»I QAA 
EAMKGTEWVDPBDPTVXAETELLGAAAS I EAAAKKIiEQLKPRAX 
PKQADETLDPEBQILEAAKS I AAATSALVKSAS AAQRELVAQGK 
VGS IPAKAADDGQWS OGIil SAARMVAAATSSLCEAANASVQGHA 
SRKKTaSSAKQVAASTA^IJjVACKVKAIX^r^BAMRRJjQAAGNAV 
KRASDNLVRAAQKAAJTGKADDDD VVV KT KFVGG I AQI IAAQEEM 
LKKERELBBARKJOAQIRQQQYKPLPTELRBDBG 


6011 


446 


1835 


LLQPAMRKSPGLSDCIjWAW I KLLSTLTGR S YGQPSLQDKLKDNT 
TVFTRILDRLLDGYDNRLRPGIX3ERWEVKTDIFVTS PGPVSDH 
DMBYTIDVFFRQSWKDBOUJCFKGPM^^ 
FHNGKKSVAIINMTMPNKLIiRITSDGTI^ 

GRDFPM\D\AHACPIJCFGSYAYTRABVVYBW1REPARSVVVAED 
GSRLNQYDLLGG/IVDSGXVOSSTGEYV^ 

QTYLPCXMTVI I>SQVS FWLNRESVPARTVFGVTTVLTMTTI>S IS 
ARNSLPKVAYATAMDW FIAVCYAFVFSAL I EFATVNYFTKRGYA 
WDGKSVVPEKPKXVKDPLIKXNNTY^ 

T1AKSAT I EPKE VKPETKPPEP KKTFNS VS KIDRLS R IAFPLLP 
G I FNLVYWATYLNRB PQLKAPTPHQ 


6012 


351 


5013 


PAELFG^FAIWHKELYDWRLGP?WQCOPVISKSIiKKPLECIKGB 

EGIQVRBIACIQKDKDIPAEDirCEYFEPKPLLEQACLIPCQQD 

CIVSEFSAWSECSKTCGSGI^HRTWIVVAPPQFGGSGCPNLTEF 

QVCQSSPCEAEFXRYSLHVGPWSTCSMPHSRQVRQARRRGKNKE 

REKI)RSKGVKDPEARELIKKKRNRNRQNRQENKYWDIQIGYQTR 

EVMCINKTGKAADLSFCQQEKLPMTFQSCVITKECQVSEW3EWS 

PCSKTCHDMVSPAGTRVRTRTIRQFPIGSEKECPE FEEKE PCLS 

QGDGWPCATYGWRTTEWTECRVD PLLSQQDKRJRGNQTALCGGG 

IO/TREVYC^ANElftJiSOJjSTHKNKKASKPW 

QIXMIPCPTECBVSPWSAWGPCTYENaroQOGKKGF^ 

NEim^SGVTGNCPHLLSAIPCEEPACYDWKAVRI^CEPDNGK 

KCGPGTOVQEVVCINSDGEEVDRQLCRDAIFPIPVACDAPCPKD 

CVI^TWSTWSSC^TCSGKTTEGKQIRARSILAYAGEEGGIRCP 

NSSALQBVR5CNEHPCTVYHWQTGPWW 

NGEASCSVGMQTRKVICVRVNVGQVGPK^OT 
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flnu.no acid segment containing signal peptide""" 
(A- Alanine, Cysteine, D^Asparhic Acid, B= 
Glutamic Acid, ^Phenylalanine. G^Glycine, 
H»Histidine, I=»Isoleucine, K=Lysine, 
L^Leucine, M=Methionine, tfcAsparagine, 
P« Proline, Q=6lutamine, R=Arginine, 
St=Serine, T=Threonine , V= Valine, 
W=Tryptophan, Y=Tyrosine, xwonknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PCKKDCI VTPYSDWTSCPS \SCKB6DSS IRKQSRHRVT IQLPAN 
GGRDCTDPLYBBKACEAPQACQSYRW\KTHKW\HRCQ\LVP\WS 
VQQDS P\GAQEGCGPGRQARAI TCRKQDGGQAG1 HBCLQYAGPV 
P ALTQACQ I P C0DDCQLT3 WS KFSS CHGIXX5AVRTRKRTLVG KS 
KKKKKCKNSHL Y PL I KTQ YCPCDKYNAQP VGIJWSDC1L PEGKVK 

vlix3mkvqgdikecgqgyliyqamac™qngri.vbtsrcnshgyi 
eraciipcpsix:ki^ewsnwsrcskscgsgvxvrskwlrbkpym 

C<3RPCPKIJ)HVNQA<2VYinm>CHSIX^QYLHVTEPWSIC^ 

NMRENCGEGVCmiKVRCWJNTADGPSEHVBDYI^PBBMPLGSR 

VCIO»PCPBDCVISEWGPWTQCVLPCNQSSFRQRSADPIRQPADB 

GRSCPNAVBKEPClflliNKNCYHYDYNVTDWSTCQI^EKAVa^I 

KTRMLIXIVRSIXJKSVDLKYCEALG 

LSDWS PWSECSQIXTGLTGKMIRRRTVTQPFQGDGRPCPSLMDQS 
KPCPVKPCYRWQYGQWSPCK2VQKAQCGKGTRTI^SCVVSrX3SA 
DDFSKWDBEFCADIBLI IDGMKNMVLBBSCSQPCPGDCYLKDW 
SSWSLCQLTCVNGEDLGFGGIQVRSRPVI IQELBHQHT.CPBQML 
F/TKSCYtXSQCYBYKWMASAWKGSSRTVWCXiRSDGINV^ 
SQPDADRSCNPPCSQPHS YCSBTKTCHCBEGYTBVMSSNS TLKQ 
CTLI PVVVLPTMEDKRGDVKTSRAVHPTQPS SNPAGRGRTW FLQ 
PPGPIX5Rl»KTWVYGVAAGAFVIiLIFrVSMIYIJVCKKPKKPQRRQ 
NNRLKPLTLAYDGDADM 


6013 


1161 


71 0 


GAFIAGVPVQ PVTilRYPNS LDTTSWAWRG PGVLKVLWLTASQP C 
S IVDVBFLPVYHPSPE ESRDPTLYANNVQRVMAQAIiGl PATECB 
PVGSLPVIWGRIiKVALEPQrj /WGTGKSASBGWAVRKLCGRWGR 
ARPBSNDQPGRVCOAATAI* 


6014 


2857 


613 


EAVAGGMBKSRM^^I^KGPDTIX:PBKDEFMK£DFDVDHFVSDCRK 

RVQLEELRDDLKLYYKW,KTAMVKLINKDYADF\VNLSTWLV3M 

DKAIJ^LSVPLGQLREBVLSLRSSVSEGIRAVDBRMSKQEDIRK 

KKMC^I^IQV1RSVEKIEKILNSQSSICETSALEASSPI.LTGQI 

LERIATBFKQI^FimC^SK\GMPia^KVRPR^ITAMLQQSLB 

QLLLEG LQTSDVD I IRHCLRTYAT I DKTRDARMVVG0VTjVKPY I 

DEVI IEQFVESHPNGLQVMYNKLLEFVPHHCRJbLR 

B3CGNTVT>GYDFTjVNSVWPQrvgGI^ 

TISMDFVRRLERQCBSQASVKRIjyiHPAYHSEKKJC^ 

R FH K T AGS LR AALTDVLEDAPABS P YCLLASHRTWSSLRRCWSD 

E^LPLIWmU,WRI^GRFWAEYSVFV\N\ELSI,RPISNESPKE 

XKKPLVTGSKEPSITQGNTBDQGSGPSETKPVVSrSRTQl,VYVV 

ADIJ^KLQF^T.PETXRIlKPKIJ^IGPK^rPSSISAALEDSOSSPS 

ACVPS1>SS KI I QPLSDS CFX3FLKSAIaEVPRLy RRTNKBVPTTAS 

SYVDSALKPLPQLQSGHKDKLKQAI IQQWLEGTLSESTHKYYET 

VSDVI^SVKJCMEESLKRI^KXIARKTTPANPVGPSGG^ 

QIjALD VE YLGEQ I QKLGLQASDI KS FSALAELVAAAKDQATAEQ 

P 


6015 


13 


2237 


AEGCAERROTEPVVBiSMSWBSGAGPGIiGSQGMDLVWSAWYGKC 
VKGKGS^PLSAHG IVVAWLSRAEWDQVTVYLPCDDHKXQRYALN 
RITVWRSRSGNELPIAVASTADLIRCKIiLDVTGGLGTO 
GMALVRFVNL 1 3 ERKTKFAXVPLKCLAQEVNI PDHI VDLRHEIiT 
HKKMPHINIX^GCYFVLDWLQKT^ 

EGIBEEt^EEDKWIVVDDITEQKPEPQDDGKSTESDVEADGDSK 
GSEEVDSHCKKAl^HKELYFJlARELLVSYEEEQFWIJBKPRYIjP 
KAIKAWNNPSPRVECVLAELKX^/TCSNRE^ 

17ITOT R AT_/^T'CVT7T?XnmT WT1XJT "I I HIT DP Cf\ *BtM\ftT T OPT DOOMtJTVA 

ALLER^SELPALGISGIRPTYIIJWTVEL 
SAGQWBARRt^TjPNCSA^LDWPRMVESCnjGSPCWASPQLLRI I 
F\KAMGG^LQDE\EQEKXLRI(^IYTQSGENSLVQEGSEASPIG 
KSPYTLDSLYWS VKPASSS FGSEAKAQQQKKQGSVNDVKEEEKE 


i 






EKEVLPDQVEEEEKNDDQEEEEEDEDDEDDEEEDRMEVGPFSTG 
QESPTAENARI*LAQK&GAIXX»£^ 

SRPRTPAE LMLENYDTHVI FWTKPVL\EQRLE PS TCK\TDTLG L 
\SCGVGS\GKCSNSS5S^RGAF1iLBARG^\GL\K^LOXP 1 


6016- 


13 


2237 


asgcaerj^tepvvei^mswesgagpgi^sqwdlvwsawygkc 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
correspondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, O Cysteine, D=Aspartic Acid, E» 
Glutamic Acid, F=Phenylalanine, G=>Glycine, 
H-Histidine, I »Iso leucine, K~ Lysine, 
L^Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q^Glutaraine, R=Arginine, 
S=Serine, T=*Threonine, V»Valine, 
W=Tryptopnan, Y=*Tyrosine, X=tTnknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ViO^KGSLPLSAHGIWAWI^RAKt^VTVYLPCDDHKLQRYAI^ 

RITVWRSRSGIffiLPIAVASTADLIRCKLLDVTGGLGTDEIJlLLY' 

GMALVRPVNLISERKTKFAKVPLKCLAQEVNIPDWIVDLRHKLT 

HKKMPKINDCRRGOTFVTJ^WLQKTYWCRQLK^ 

BGIEBBDQEEDKNIWDDITEQKPEPQDDGKSTBSDVKADGDSK 

GSEBVDSHCKKaLSHKELYBRAREJ^VSYEfiEQFTVLEKPRYLP 

KAIKAKNWPSPRVEO/LAELICGVTCENRE^^ 

FBQIiAALQIEYEENVDWTOVLVPK^ 

ALLERMLSELPAIXnSGIPJPTYILRWlTO^ 

S AGQWEARRGWRLPNCfi ASLDW P RMVK5CLGS PCWAS PQLLR 1 1 

F\KAMGQGLQDE\BQEKLIjRICSIYTQSGE»SLV0BGSEASP1G 

KSPYTl^SLYKSVKPA5S5FGSEAKACCX)EEQGSVNrnnCEEE^ 

BKEVIiPDQVKKEEBKDDQEEBEEDKDDBDDEBKDRMEVGPPSTG 

QESPTABNARLLAQKRGALQGS AWQVSSEBVRWDT FP\ LGRM PR 

SRPRT PAELMLKNYDTHVI F WTKPVL\EQRLEPSTCK\TDTLGL 

\SOGVGS \GNCSNSSSSNPRGAPLLEARGSLH\GL\KTGIiQLF 


6017 


203 


3469 


SHQEIEQNSAMAPRKRGGRGISFIFCCFRNfJDHPKITYRLRNDS 

NFALQTMB PALPMP P VEEIiDVM F5ELVDELDLTDKHREAM FALP 

ABKKWQIY CS KKKDQE ENKGATS WPEFYIDQLKS MAARKS LLAL 

EKBEBBERSKTI BSLKTALRTKPHRFVTRFI DLDGLSCI LNFLK 

TMDYBTSBSRIHXSIalGCIKAI^NNSQGRAHVLAHSBS INVIAQ 

SLSTENIKTJWAVLEILGAVCLVPGGHKiCVLQAHLHYQK^ 

rRFO^INBI^XSTGRYRDKVSLKTAI^FlNAVLSO^^VESL 

DFRLHLRYE \ FLMLG III P VMDKLRKHE^TIJ>RHLDFFEMLRNE 

DELE FAKRPELVHIDTKS ATQK PBLTRKRLTHS EAYPHFMS ILH 

HCXQMP YKRS GNTVQY WLLLDR 1 1 QQI VIQNDKGGDPDSrPL EN 

FNIKNWRMLVNENEVKQWKiSQAEKM^ 

DAKTQEKEEMMQTLNKHKEKLEKETTEH^ 

LSRRAVCAS I PGGPSPGAPGGPFPSSVPGSLLPPPPPPPLPGGM 

LPPPPPPLPPGGPPPPPGPPPt/JAIMPPPGAPMGLALKKKSIPQ 

PTNALKSFlWSKLPEinCLEGTWWEIDDTKVPKIIiDIiEDLERTF 

SAYQRQQDFFVWSNS KQKEAI^IDDTI^SKLKVKBLSVI DGRRA 

QNCNIU£RLKI^aiDBIKRAILTMDEQ2D^^ 

KSrnDLLKEHKHELDRMAKADRFLFKMSRX^QQRLQSLYFKK 

K^AERVAEVKPKVSAIRiJGS^WR.SGAJ^KOLLEVVrAPXJWYMN 

KGQRG»AYGFKlSSIOTIAiyrKSSIDKNITLLff5(LITIVENKYP 

SVIiNI^SI^DIPQAAKVN^ELDKEISTLRSGLiCAVETELEYQ 

KSQPPQPGDKFVSVVSQFTTVASFSFSDVEBLLAEAKDLFTKAV 

KHFGEEAGKTQPDEFFGIFDQFI/^VSEAKQENENMRKKKEEBE 

RRARMEAQIJCEQRERERKMRKAKENSKESGEFDDLVSALPvSG^ 

FDKDLSKLKRNRKRITNQMTDSSRERPITKXNF 


6018 


13 


2510 


TISQSGGIRRRRPJWWFEVVNMDFSRUI^SPPQC^FJSTGYTY 

ALSSSYSSDALDP^TBHKLDPVF1)SPRMSRRSI»RLATTACTLGD 

GEAVGADSGTSSAVSLKNRAARTTKQRRS TNKSAFS INHVSRQV 

TSSGVSYGGTVSrX}DA\rTRRPPVLDESWIREQTT\n>HFWGLDDD 

GDLKGGNKAAIQGNGDVGAGAAIX?HNGFFCSNaqML^ 

AIIPAAPGPVSRVYSRDRNQKCDDCKGKRHLDAHPGRAGTLWHIN 

ACAG YFLLQILRR IGAVGOAVS RTAWSALWLAWAPGKAASGVF 

IWI^IGWYQPVTLISWIiNVFLLTRC^^ 

IiSLRGQG\NFFSFLPVLNWASMHRTO^^ 

PLQGD SEAFPWHWMSGVEQQ VASLSGQ CHHHGENLRELTTLLQK 

LQARVIX5MEGGPJU5PSASVRnAVGQPPRETDFMAPHQEHBVRMS 

KLEDIJ^KLREKSEAIOKEI^OTKQKTISAVGEQLLPTVEm^L 

EI^LKSELSSWPJlVia^CETVDAVQERVDVQVREMVKLLFSED 

a^SLEQIiQRFSSQP^KGDLQTMI^ 

TKQLPTS EAVVSAVSEAGASGI "KAQARAI VNSALKLYSQDKTG 

MVDFALB5GGGSII 1 STRCSFn^ETin'AIiMSI^IPLWYFSa < 5PR 

WIQPDI YPGNCWAFKGSQGYLWRLSMM IHPAAFTLEH I PKTL 

S PTGNIS SAPKDFA VYfil.PNEYQKEGQLliGQ PTYDQDGH S LQMF 

QALKRPDDTAFQIVELRIFSNWGHPEYTCLYRFRVHGEPVK 


6019 


2 


1066 


T PNDREP P PQRPPS SRRASHLAQB I TSAAS LGDQTQI LGSLTTA 
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SE0 
ID 
NO; 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H^Hietidine, I-=Ieoleucine, K=Lysine, 
L=Leucine, M-Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V-^ Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *«stop 
Codon, /^possible nucleotide deletion, 
V^possible nucleotide insertion) 








PVITSAIRSMPGISSQILTNAQGQVIGTJaPWWNSASVAAPAPA 
QSI^VQAVTPQLIJJiAQGQVIATIiASSPLPPPVAVRK\PSTPES 
LLKSBVQP I KPTPTVPQPAWIAS PAPAAKPSASAPI PITCSET 
PTVSQLVSKPHTPSLDBBGINLEEIRBFAKNra^ 
QVGQALTATEG P AYSOS AI CRFE KLDI TPKSAQKLKP VLEKWLN 
EAEI^NQEGQONIiflKFVGGEPSKKRKRRTSFTPQAIKAI^ 
KNPLPTGQB II^IAKELNYDREVVRVWFCNHRQTr,KlJTS KLNVF 
QIP 


6020 


4953 


549 


mQFBVSIGNYGNKFDTTCKPIiASTrQYSRAVFDGNYYXYLPW 

AHTKPVVTLTSYWBDISHPJLBAVN^ 

QGKIPANQIiAEIiWI»KLIDBVIEimOT 

RKIiRSRSLSQrHEAAVRMI«KATDVKSTTJ^rBDWIJ)BXr^ra 
EPQN3KPDI I IVMIPjGEKRLAyARIPAnQVLY STSGENASGKYC 
GKTQTIPr^PQBKNNGPKVPVBLRVNIWIXjriSAVBKlCFlJSFAB 
GTPTVFAP^YKNQALMFGKWGTSGLVGRHKFSDVTGKI KkKREF 
FIiP?KGWEWE£EWIVDPBRSIiLTEAnAG^^ 
GGDWKPABDTYTDAmDKAASPSKLTCPPGWEWKDDAWSYDINR 
AVDBKGWEYGITIPPDHKPKSWVAAEK^ 
IiTQTASSTAGAMEBI^DQEGNKYASIjIGWKFHWKXJRSSDTPRRR 
RWRRKMAPSETHGAAAl FlOiEGAIiGADTTEDGDKKSLEKQKHSA 
TTVPGAiraPIVSCKPDSDYIYHIJlCYVYQARin^I^^ 
YAHICFLHRSXTTEI IHSrUSPTWIXOTIFI)BVElYGBP<yrVLQ 
NPPKVIMELPTJNDQVGKDEFLGRSIFSPVVKLNSEMDITPKLU* 
HPVMNGDKACGDVLVTAEIjIIiRGKPGSNLP I LP PQRAPNL YMVP 
QG I R P VVQLTAI PIIiAWGLRNMKNFQMAS I TS P S LWECGGERV 
BSVVXKNLKKTPWPSSVLPMKVFLPKESLWPPLVIKVIDHRQ 
FGRKPWGQCTI BRl^RFRQ^P YJ\OKEDIVPQ1jKASIiIiSAPPC31 
D IV I EM EDTKPLIiAS KCLS SMSTALS XMAS PATVHLT E KBEE IV 
DVWSKFYASSGEHKKr^QYlQKGYSKLKXYNCELBNVAEFEGLT 
DFSDTFKLYRGKSDHNEBPSWGBFKGSFRIYPLPDDPSVPAPP 
RQFRELPDSVPQBCTVRIYIVRGLFJiQPQDNNGIjCDPYI kitlg 

TFTRDEKVGBTIIDIjENPFXLSRFGNSHCEXIPEEYCVSGVIJTW 
PJ)SLR\PTQ\l«l^VARFKGFPQPIl^BDGSRIRYtX^ 

feankii^hlgapbbrlalhij^tqglvpehvktrti^ 

NIS\RYY1jRVIIWNTKDVILDBJKS1TGEB^DIYVKGWIP<5NBE 

NKQKTDVHYRSIiDGEmFNWRFVPPFDYliPAEQLCIVAKKE^ 

SITXyTEFRlPPR\LIIQI>l\DNDKFS\LDOTI^FPRTLTCRHTI 

HFLQKSPGGNC /RGIJW I PDkKAM^ 

PCYAEKIX3ARVMAGKVE>rrLEILNBKEADERPAG 

KLDLPNRPETSFIjWFTP^KTMKFIVHRRFKWVI IGLLFIilLL 

LPVAVIJLYSLPKYLSMKIVKPW 


6021 


4953 


549 


EAIQFWSIGNYGWKFDTTCKPrJ^TTQYSRAVFIX^NYYYYXPW 

AZ-mCPVVTLTSYWEDISHRimVim^^ 

QGKI PANQLAELWLKLIDEVI BryTRYTLPLTEGKANVTIVTjDTQ I 

imiRSP^LSQIHEAAVPJraSFATDVKSTI^ 

EPQNSMPDI I IWMIRGEKRLAYARI PAHQVLYSTSGBNASGKYC 

GKTQTIFLKYPQEKNNGPKVPVBLRVNI 

GTFTVFAEMYBKQ ALMFGKWGTSGDVGRH^ F 
FlaPPKGWEVn^P^/IVDPERSLLTEADAGm'EFiXJEVYQNESRYP 
GGDWKPAEIXrYTDANGDKAASPSELTCPPGWEWEDnAWSYDINR 
AVDEKGWEYGITIPPDHKPKSWVAAEKMYHTHRRRRLVRKRK^ 
LIXOTSSTAGAMEELQDQBGWBYASLIGWKFIiWKQRSSDTFRRR 
RWRRKMAPSETHGAAAl FKLBGAIiGADTTEDGDEKSLEKQKHSA 
TTVFGANTPIVSCNFI>RDYIYHIjRCY^ 

YAHI CFItHRS KTTB I IHSTLNP 7WD0TI I FDBVE I YGEPQTVLQ 

NPPKVIMBLFDNDQVGKDKFLGRS I FSFVVKLNSEMDITPKLLW 

HPV^GBKACGI^VTABLIIJlGKrXSSNLPILPPCjU^^YM^ 

QGlRP^LTAIEIIJuTCUaiMKNFQM^ 

BSWIKNLKKTPNFPSSVI.FMKVFLPKEBI.YMPPLVIKVIDHRQ 

FGRXPVVGQCTIEPJLDRFRCDPYAGKEDIVPQLKA^ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine , D^Aspartic Acid, 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I^lsoleucine, K=Lysine, 
I*=Leucine, M«nethionine, N=Asparagine f 
P~ Proline , Q=Glutamine , R^Arginine , 
S=Serine, ^Threonine, V^Valine, 
W-Tryptophan, Y-Tyrosine, X^Dnknown, +=3top 
Codon, /^possible nucleotide deletion, 
\=poeoil>le nucleotide insertion) 








DIVIHraDTKPLLAS^^ 

dwwskfyassgkhbkcgqyiokgysklkiyncklenvaejeglt 

DFSDTFKLYRGKSDENEDPSWGEFKGSFRIYPIiPDDPSVPAPP 

RQPRELPDSVPQECTVRI YIVRGLELQPQDNNGLCDP YI KITLG 

KKVTE \DRDHYIPNTLNPVFuRMYfcIjSCYL 

TFTRDEKVGETI IDLENPF\JjSRFG\SHCG\ I PEBYCVSGVNTW 

RDSLR\PTQ\LU^AREKGFPQPILSE1X3SRIRYGGRDYSI 1 DK 

PEANKILHQHI^APBERIAI^IIiRTQGJLVPBi^ 

NIS\RYYI>EVIIWKTKDVIIiDBKSlTGEBMSDIYVKGWIPGNBK 

l^QKTDVHYRSI^KGWFl^FWPFDYLPAEQLCIVAKKEHPW 

SII^TBFRIPPR\LIIQIW\im>KFS\IJ3DYLGFPRTI,TCRHTI 

HPLQKSPGGNC/RGLDMIPDLKAMNPLKAKTASLFEQKSMKGWW 

PCYAEKTX^VMAGKVEtTTLBIl^KEADERPAGKGRDEPNMNP 

KIiDLPNRPETSFbWPTNPCK^l^PlVWRREK?7VIIGl;LFI.riILL 

LFVAVJjLYSIiPNYLSMKIVKPKV 


6022 


4*53 

- 


549 ~ 

• 


^IQFEVSIG^GNK^irTTCKPIiAS TTQYSRAV F1XIN YYYYLPW 

AHTKPVVTLTSYWBDISHRU)AVMTLIkAMAERIiQ^ 

QGKI PANQIJVELV^LlDE\aEiyrRYTLPLTBGIOUmVLDTQI 

RKIJiSRSLSQIHKARVPJ^SEATDVKSTl^ElEDWLDKLMQLTE 

EPQNSMPD 1 1 IWMIRGEKRLAYAR J PAHQVL YSTSGBNA3GKYC 

GXTQTIPIiKYPQEK^GPKVPVBIJlVNIVTOI^WKKPNSPAE 

GTFTVFAEMYENQALMFGKWGTSGLVGRIOC^ 

FLP PKGWEWBGEN IVDPERSLLTEADAGHTEFTDEVYQNESRYP 

GGDWKPAEDTYTDAITCDKAASPSELTCPPGWEWEDDAWSYDINR 

AVDEKGWEYGIT X P PDHKPKSWVAAEKMYBTHRRRRIjVPJKRKKD 

LTO^ASSTAGAMEELQDQEGWEYASLIGWKFIIWKQRSSDTFRRR 

RWRRBMAPSETHGAAAI FKLEGAI^aADTTEDGDEKBIiBKQKHSA 

TTVEGAOTPJVSCNFDRDYIYHI^CYVYQARNLIALDKBSFSD 

YAHI CPLHRSKTTEI IESTLNPTWDQTI I FDEVE IYGEPQTVIjQ 

NPPKVlMEIiFDNDQVGKDEPLGRSI FSPWKLNSEMDITPKIjLW 

HPVMNGDKACGDVLVTAELILRGKDGSNLPILPPQRAraLYMVP 

QGIRPWQLTAI EILAWGLRNMKNFQMAS ITSPSLWECGGERV 

ESWIKNIiKXTPNFPS^iFMKVFI^^ 

FGRKPVVGQCTIBRIiDPJ?RCl>PYAGKEDI VPQLKAS LLSAPPCR 
DIVlEMEDTKPUASKCI>SSHSrALSKMASPATVHT,TEKEEEIV 
DWWSKFYASSGEHEKCGQYIQKG YSKLKI YNCEIJBNVAEPEGIjT 
DFSIJrFKLYRGKSDSNEDPSVVGBFKGSFRIYPLPDDPSVPAPP 
RQPRBLPD3VPQBCTVRIYIVRGIjEIjQPQDWNGI»CDPYTKITIX3 
KtCVIE\DRI>HYIPjnT^PVFGRMYEliSCYI>PCjEKDLKISVYDYD 
TFTRDEKVGETI IDLENPF\LSRFG\SHCG\IPEEYCVSGVNTW 
RDS hR \ PTQ\UiQNVARFKGPPQP 1LS EDGSRIR YGGRDYSLDE 
FEANKILHQHI^APBERIJO^IIiRWI.VPEHVBTRTiaSTPQP 
NIS\RYYIJiVIlWNTKDVIIJ>EKEITGBEMSDr5WKGWlPGNEE 
NKQKTDVHYl^LDGEGNFl^FVFPFI} 

SIDQTEFRIPPR\LIIQIW\DNDKPS\IJ>DyiX?FPRTI»TCRHTI 
HFLQKSPGGNC /RGLDM1 PDLKAMHPLKAKTAS kFEQKSMKGW W 
PCYAEKDGARWJ^KVB^rrLEII^KEADBRPAGKGRDEPI^P 
Kl^fcPimPETSFLWFTNPCKTMKFIVWRRFK^ 
LFVAVLLYSLPNYLSMKIVKPNV 


6023 


102 


916 


SQEI^FVKI^JNU^PDRAE<K3KI*TL^ 

S FYLKANCKVCFVALI QS FSHYS IVGQKLGVSLTMARERGQLVF 

LEGIi/ IVC SGR\VFOAQKEPHP LQFLREANAGNIJCPIiFE FVREA 

LKFVDSGEARWTYPVLIiVDDIiSVIJ^IC^ 

VCSHSIjKGNMVVLVHDSGDABD^ 

ATGFCRDVHGQLRI LWRRPSQPAVHRDQSFTYQYKIQDKSVSFF 
AKGMSPAVI* 


6024 


3 


3260 


FI>SFI>CYPRFR(^FCLQFAIPASRMBQLNELELLMEKSFWKEAE 
LPAEtiFQKKVVAS^RTVLSTGHDNRYl^ 
RLVITASOSX*BNKELCIIjRNDWCSVPVEPGDI ihlegdctsdtw 
I IDKDFGYLILYPDMIjI sgtsiassircmrravi»setfrssdpa 

TRQMLIGTVLHEVpQKAJNNSFAPEK 
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SEQ 
ID 
MO: 


Predicted 
beginning 
nucleotide 
location ' 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
tA= Alanine, C= Cyst cine, D-Aspartic Acid, E- 
Glutaraic Acid, F=Phenylalanine, G=Glycine, 
H=IIistidine, I=Isoleucine, K^=Lysine, 
L-Leucine, M-Methionine, N^Asparagine , 
P= Proline, Q=Glut amine, R=Arginine, 
S-Serine, T=Threonine, V»Valine, 
W^Tryp t ophan , Y=Tyrosine, X-Unknown , *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








YRLNIiSQDEIKQEVEDYLPSFCKWAGDFMHKNTSTDFPQMQLSL 
PSDNSKDNSTCNIEWKPMDIEES IWSPRFGLKGKIDVTVGVKI 
HRGYICIKYK I MPIiKLKTGKESNS I EHRSQWIiYTLLSQERRADP 
KAGLbLYl»KTGQMYPV?ANHIX>KRELLKI^QMAPSLPHRISKB 
ATRQKTQLAS I»PQ I IEEEKTCKYCSQIGNCALYERAVEQQMDCS 
SVPIVMl^KIBEETQHIJCQTHLBYFSLWa^ 
HQNIWI^PASBMEKSGSCIGNIiIPJIEHVKIVaX^YbHNFQCKH 

PVTlttMAGDKVTVSGEEI^I»FALSRGY 
RNLS VIiPESTLPRLDQEEKNCDI OTPI/3NI^KIrflENTFVSKKLR 

VIiLSKDYTLIVGMPGTGKTTTICTLVPaLYACGPSVLLTSYT^ 

AVDNILLKLAKFKlGFTJElSfi\QIQKVHPAIQQPTEHHICRSKSl 

KS \LALIiEELYTSQLl DATTCMGXNHPIPSRKI FDFCI VDEASQ 

ISQPICLGPLFFSRRPVLVGDHQQLPPLVLNBBARAI^GMSESLF 

KRLB(JWKSAVVQLTVQYRMl^KIM3LSNKIiTYBGKI£ 

NAVINLRHFKI>VKtiBI.BFYADYSD^^ 

KVPAPEUVKKGGVaWTKAKT.1 VFLTSIFVKAGCSPSDIGI IAP 
YRQQLKI INDLIiARS IGMVBVNTVDKYQD\RDKS I VLVSFVRSN 
KDGTVGELIiKDWR RT iNVAT TRAKHKLI LLGCVPSLNCYPPLEKL 
LNHIiNSSKLIIDt.PSREHRSLCBILGDFQRE 


6025 


3977 


89 


GGFPAQSDHLPPVPPI^DI^TMSTLYVSPHPDAFPSLPJVIilA 
ARYGEAGEGPGWGGAHPRICLQPPPTSRTSPPPPRLPALBQGPG 
GLWVWGATAVAQUWAGIiGGPGGSRA^ 

AACGATLPAlX31>RSSAQDPQAVLGALGRAIiSPI^EWIiRLHTYliA 

(ffiAPTIADIAAVTAl^PFRYVLDPPARRIWNNVTRWFVTCVRQ 

PEFRAVIjGEVVIiYSGARPIjSHQPGPRAPALPKTAAQLKKEAKKR 

EKLEKFQQKQKIQQQQPPPGEKKPKPEKREKRDPGVITYDI.PTP 

PGBKKDVSGPMPDSYSPRYVKAAWYPWWEQQGFFKPEYGRPWVS 

AATCPRGVFMMCIPPPNVTOSLHLGHALTN^ 

TOWNPGCDHAGlATQVVVEKKLWREQGI^PaiQLGREAFLQEW 

XWKEEKGDRIYHQLKKI/3SSLDHDRACFTMDPR1«SAAVTBAPVR 

lilEEGI lYRSTPJtiV^SCTI^SAlSDIBVDKl^TGRTLIiSVP^ 

YKEKVE FGVLVS PAYKVQGSDSDEB VWATTR I KTMLGDVAVAV 

HPKDTRYQHLKGIWIHPFIiSRSLPIVFDEFVDMDFGTGAVKir 

PAHIKWYEVGX}RHGI^AISIMDSRaALINVPPPFLGLPRFEAR 

KAVLVAL KERGLFRG I EI^PMVVPLCNRSKDVVEPLLRPQWYVR 

CGEMAQAASAAVTRGDLRI IjPERHQRTWHAWMDNIRE \WCMFPG 

KLWWG\HR\IPAYFVTVSDPAVPPGEDPDGRYHV3GRNBABARE 

KAAKEFGVSPDKISI^DBDVLDTWP33GLFPI^ILGWPN0SED 

LSVFYPGTLLETGHBILFFWARMVMI^ 

IVPJ^GRJ04SKSLGNVIDPLDVIYGISIiQGI^KQLI^SNIJ>PS 
EVEKAKBGQKADFPAGIPBCGTD^FGW^YMSQGRDINLDW 
RlLGYimFC^LWNATKPAIJiGI^KGPVPSPTSQPGGHESLVDR 
WIRSRLT^VRLSN^PQflYDFPAVTTAQYSFOT.YKLCDVYLEC 
IjaWLWGVDQVAABCA^ 

RLPRRMPQAP PSLCVTP YPEPSECSWKDPEABAMjELALS ITRA 
VRP\LRADYNIiHPESGPTCFIjEVAD\EATCAIASAVSGYVQGPG 
QAQVWAVAEPWGLPAP\QGCAVALASDRCSI\HLQLQG\LLDP 
ARBLG\KLQ\AKRVEAQ\RQAQ\RLR\ERRA\ASGWPVKVPIj \e 
VQEADEAKUJQTEAKLRKVDRAIALFQKML 


6026 


2674 


514 


GP ITFUOQCAKMKDMPLRIHVIiIX3IAITTLVQAVDKKVDCPR^ 

TCEIRPWFTPRSlYMEASTVDCNDr/ILLTFPARIiPANTQII^ 

TtTOlAKIEYSTDFPVNLTGLDLSC?lNI£^^ 

YLEENKLTELPEKCLS EI^NLQE LYT NHNLLST I SPGAFIGLHN 

LIALHIJSISNPJ J QMINS KWFDALPNLEILM IGENPI I RI KDMNPK 

PLINLRSLVIAGINLTEIPDNALVGLBNLESISPYDNPJjIKVPH 

VALQKVVNLKFLDLNKNPIiraiRRGDFSNM^ 

ISlDSIAVDl^PDI^KIEATNNPPJ^YIIIPNAmiLPKLESIiML 

N5NAI>SALYK^TlBSLPNLKEISIHSNPIRCaX^IRWMNMNKTN 

IRFMEPDSIiFCVDPPEFQGOtWRQVHPRDMMEICliPLIAPESFP 

SNLNVBAGSYVSF1ICRATA\EPQPEIYWITPSGQKLLPNT\I>TD 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location, 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid, segment containing signal peptide 
(A=AIanine, C^Cysteine, D=*Aspartic Acid, B= 
Glutamic Acid, F-Phenyl alanine, G=Glycine. 
H^Hietidine, I=Isoleucine, K=Lysine, 
L=sl»cucin.e # Ms=Methionine, N=Asparagine, 
P=Proline, Q-Glut amine , R=Arginine, 
o>oci A uc/ l-mteomnc, vsvajLUic, 
W»Tryptophan, Y=Tyrosine, X^Unknown, *~Stop 
Codon, /=possible nucleotide deletion, 
V=possible nucleotide insertion) 








KFYVHSEOTLDI^VTPKEGGLYTCI ATNIiVGADLKSVMI KVDG 
SFPQDNNGSIOTKIRDIQANSVLVS5navSSKII.KSSVKWTAPVK 
1^SHAAQSARIPSDVKVYNX,THLNPSTBYKICIDIPTIYQKNR 
KKCVtrVTTKBLHPDQKEYBKHWTTTLl^ IGVI CLI S 
CbSPKMNCIX^HSYVRNYLQKPTFALGELYPPI>INLWEAGKKKS 


6027 


5254 


414 B 


GGRRAPGRPGRS IKDKEEBTVFREWSFSPDPI,PVRYYDKDTTK 
PISFYI^SLmOiAWKPRLEDGFlW^ 

IiLCHCMMGGYLDDRF IQGSVVCTrPYAFYHHQCIDVFVYFSHRTV 
T I PPVGVfn^TAHRHGVCVLGT FI 

QAVADRtiVQIT\RFFRFDGWIi r MIENS LSTiAAVGNMPPFLRYLT 
TQLHRQVPGGLVLWYDSVVQSGOLKWQDEI24QHNRVFFDSCDGF 
PTNYNKREE^naSRMI^AUKRRADVYVGVDVFARGNVVGaRFDT 
DKVGGGFRPRASGPVPPLGPHFLMDLPFPSAPQRNDSSCSSQSG 
DPVALRNRCPAPAKLCPH 


6028 


120 


3432 


NCI*LIiGAKGFHGEIEDLQQWI.TOT 

LNVHMEVCAAFEAKEETYKS LMQ KGQQNLARCPKSABTtJIDQD I 
N1^KEKWESVETKI.NRR\KT\K1^EALNIA\MBFIINSL\QDFI1J 
WLT0AF5QTr»NVASRPS LrLDTVLFQIDBHKVFANRVNSHRBQI I 
FJbDKTGTHLKYFSQKQDVVLIKI^ 

SLDDARKliAK^FHKAWSKLMBWl^ESEKSI^FJC>BIANDPDra K 

TQLAQHKEFQKSLGAKIIflVYDTTNRT^ 

DMI>S3IjRDKWl>riOTKSVERQNKIiEKA\l*I»FSG^ 

WLYRVEi>QIJVKDQPVHGDIDLVMNLIDiraKAFQKEIiGICRTSSVQ 

ALKRSARELIEGSRDDSSWVKVQMQELSTRWETVCALS ISKQTR 

LEAAliRQAEE FH SVVHAIjLEWIiAFAEQTLRFHG VLPDD EDALRT 

LIDQHKEFMKKLEEKRAELNKATTMGiyiVLAJ 

ITI IRARFEEVIJIWAKQHQQRU\SALAGLIAKQEI^^ 

ViAETTLTDKDKEVI PQE I EEVKALIAEHQTFMEEMTRKQPDVDK 

VTKTYKRRAADPSSLQSHI PVLDKGRAGRKRFPASSLYPSGSQT 

Q I BTKNPRVNLLVS KWQQ VWIiLTUJERRRKLNDAliDRIiEEIiREFA 

NFDFDIWRKRYMRWOTmKSRVMDFFRRrDKDQDGKlTRQEFID 

GIX^SKFPTSRLEMSAVADIFX)RI)GDGYTDYYEFVAAt>E^2«^ 

YXPITDADKI EDEVTRQVAKCKC AKRFQVRQIGI^fKYRFFLGNQ 

FGDSQQI.R LVRILRS TVM VRVGGG WMALDE FI>VKNDP CRA KGRT 

IWBLREKFXIJaxy^QGMAAFRPRGRRSRPSSRGASPmSTS^ 

SQAAQAASPQVPATTTPKI LHPLTRNYGKPWLTNS KM3TPCKAA 

BCSDFPVPSARGTPIQGSKLRLPGYl*SGRGFHSGBDSGIiITTAA 

ARVRTQPADSXKTPSRPGSRAGSKA6SRASSRRGSDASDFDISE 

IQSVCSDVETVPQTHRPTPRAGSRPSTAKPSKIPTPQRKSPASK 

LDKSSKR 


6029 


1 


3533 


IMPCX5SSRL*LRGCWTHF^PVSDI^YFDCIF^VMBNSKVLGESM 
AGISQNAKTGDLPAFGECVG I ASKALCGLTEAAAQAAYLVG I FD 
PNSQAGHQGLVDP IQFARANQAI 0>!ACONLVDPGS SPSQVlCiSAA 
TXVAKHTS ALCNACR I AS S KTANPVAKKHFVQSAKEVANSTANIi 
VKTIKAIJXn)FSEDNRNKCRIJU7^LTEAVENLTAFASNP 
I PAQISSEGSQAQEPIIiVSAKPMl^SSSYtilRTARSIAINPKDP 
PTWSVI^llSHTVSDSIK^IiITSIRDKAPGO^a)YSrDGINRC 
I RDIEaASIJU^VSQSLATRDD IS VEALQEQLTSVVQE IGHLIDP 
IATAARGEAAQLGHKGTQIiASYFEPL IIAAVGVASKILDHQQQM 
TVC/DOTKTLAKSALOMLYAAKEGGGN P KAQHTHD AI TEAAQLMK 
EAWDIMVTLNEAASEVGLVGGMVI^IAEAMSKIJDEGTPPEP^ 
TFVDYQTTWKySKAlAVTAQEMMTKSVTNPEB 
YGIILAFOGQMAAATAEPEBIGFQIRTRVQDLGHGCI FLVQKAG\ 
ALQVCPTDSYTKREL1 KCAKAVT EKVS LVI^ALQAGNKGTQACI 
TAMAVSGI I ADLDTT I MFATAGTLNAENSETFADHRENTLJCTA 
KALVEDT HXI,VS(MST PDKIAOAA^^/iATI TQI JVEVVKLGAA 
SLGSDDPBTQWLINAI KDVAKALSDCLIS ATKQAAS KPVDDPSM 
YQLKGAAKV^5VTNWSLIiKTVKAVEDEATRGTRAIiBATIBCI KQ 

BIjTVFQSKDVPBICTSS pbbs irmtkgitmatakavaagnscrqb 
dviaiani^rxavsdmltackqas fhpdvsdevrtralrfgtec 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre spondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C=Cysteine, D=Aspartic Acid, E=* 
Glutamic Acid, F=.Phenyl alanine, G=Glycine, 
H^Histidine, I»Isoleucine, K=Lysine, 
L=Leucine, M»Methionine, N=Asparagine, 
P»Proline, Q^Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W«Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\ -possible nucleotide insertion) 








TLGYLDLLEHVLVI LQKPTPELKQQLAAFS KRVAGAVTELIQAA 
EAMKGTE WVD PED PTVI AETEIiIiQAAAS IEAAAKKLEQLKPRAK 
P KQADETLDFEEQILEAAKS XAAATS ALVKSASAAQK£LVAQGK 
VGS I PANAADDGQWS QGL I SAARMVAAATSSLCEAANAS VQGHA 
SEEKLISSAlCQVAASTAQLLVACKVKADQDSEAMPJtlJQAAGNAV 
KRASDNIiVRAAQKAAFGKADDDDVVVKTKFVGGIAQ I IAAQEBM 
LKXKREI^EARKKLAQIRQQQYKPLPTELREDEG 


6030 


3 


1777 


FPGttJC^PAI^LEVLICLGLMGI»ERALNVIA 

TSRRVELLI FSHIjHBLS LR WHIiGRRTGEVLR IADRGTSSVTGLL 
SYLVFWIPTLADIIIGIIYFSMFFNAWFGLlVKt.TNSI.YLTLT 
IVVTEmTKFRRAMNTQENATRARAVDSLU^ 
VERYREAI I KYQGLEWKS S ASLVtiLWQfTQNLVI GLGI/LAGSLLC 
AYFVTEQKLQ VGDYVT.FGT YI I QLYM PLNWFGTYYRM 1QTNF1 D 
MENMFDIiLKK\3TEVKDI»PGAGP FRFQKGR IE FENVHFS YADGR 
BTI^DVSPTVMPGQTLALVGPSGAGKSTIIJU>LFRFYDISSCX:I 
RIDGQDISQVTQALFRFSHWBiCPKDTVLFNDTIADNIRYGRVT 
AGND B VKAAAQAAG I HDAI MAF PFX; YRTQVGKRGIiKT^GGBKQR 
VAI AR T I LKAPG I ILLDEATS ALDTS NER A I Q AS I AKVCANRTT 
1 WAHRIiSTWNADQILVI KDGCI VERGRHEALLSRGGVYADMW 
QLQQGQKETS EDTKPQTMER 


6031 


160 


16^94 


LHMSKNLJ3KSNVNEAGKSKSNDSEEGIjBI)AVEGADRALQ 

DSS S PQRVQR PHSSPPRFVTVEBIXKTARGVTtTMALAHE I WNG 

DFQIKPWLPENSLKKRVKEIVHKAFWIXn>SVQLSBDpPAYDHA 

IKLVGEIKBTI^SFLLPGHTRliPJJQITF^DLDLIKQBAENGAL 

D1S KLAEFI IGMMGTLCAPARDBB VKKLKD I KE XVPLFRE I KS V 

LDLM KVDMANFAISS IRPHLMCX3S VE YERKKFQE I LERQPNS LD 

fvtqwleeasedl^tqkykhalipvggmaagsgdmprlspvavqn 
yaylkllkwdhlqrpfpetvlmdqsrfhblqlqXreqltilgav 

LLVTFSMAAPGISSQADFAEKLKMIVKILLTDMHLPSFHLKDVL 
TTIGEKVCXBVSSCI^LCGSSPFTTDKBTVLKGQIQAVASPDDP 
IRRIMBSRILTFIjEm^GHQKI>LPTVPGGLSPVQREl,BEVAI 
KFARLVNYNKMVFCPYYDAILSKILVRS 


£632 


39 


2415 

• 


AARLCRAQPTKSAWMIRDIiSKMYPQTRIIPAPHUFAUPFKFTISE 
SCDR I KEEFQFLQAQYHS LKIjECEKLASE KTEMQRHYVMYYEMS 

ygijfl emhkqae i vkrlnai caqvl p fls qkhqq0 wqavkrak 
qvtmafj^aiigo^IjOaqhi>shghgi^vpltphpsglqppaip 
pigss agllalssalggqshlp i kdbkkittldndhqrdrds i kss 

SVS PSAS FRGAEKiniNSAD YS SESKKQKTBEKE lAARYDSDGEK 
SDDNLVVDVSNEDPSSPRGSPAHSPRBNGl£)KTRIiKKDAPISP 
ASIASSSSTPSSKSKEr^LNEKSTTPVSKS^^rPTPRTI>APTPGS 
NSTPGLRPVPGKRPGVDPIJVSSI^TPMAVPCPYPTPFGIVPHAG 
MNGELTS PGAAYAGLHMISPOMSAAAAAAAAAAAYGRSPVVGFI) 
PKlffiMRVPAipPNLTGIPGGKPAYSPHVSADGQMQPVPFPPDAL 
IGPGIPRHARQINTLKfHGEWCAVTISNP 

DISIIPGNKSPVSQ^CLNRDNYIRSCRLLPDGRTLIVGGEASTL 
S IWDLAAPTPRI KABLTSS APACYAIAI SPDSKVOTSCCSDGNI 
AVWDLJRJQTLVRQFQGirTIXiASClDISl^ 
W\DIJIEGRQIJQQHD/FFTSPVFSI^CP\TBEWIAVGMENSN\V 
EVLHVTKPDKYQLH7^SSCVI>SI,KFAHrc^ 
W\RTPYG\ASIF\QSKBSSS\VLSCDI \SVDDKYIVTGS\GDK\ 

KATVXJSVIX 


6033 


39 


241S 


AARLCRAQPTKSAWMtRDI^KMYPQl«RHPAPHQPAQPFKFTlSE 
SCDRIKEEFQFLQAQYlISLKLECEKIiASEKTEMQRHTVWYYFJ^ 
YGLUTI EMHKQAEI VB3iLNAl CAQ V I PFLSQF^CXXJVVQAVERAK 
QVTMABLKAI IGOQQLQAQHLSHGHGLPVPLTPHPSGLQPPAIP 
P I GSSAGLLALSSALGGQS HLPI KDBKKHHDNDHQRDRDS I KS S 
S VS PSAS FRGAEKHRNSADYS S E S KKQKTEEK3I AARYDS DGBK 
SDDNLWDVSNEDPSS PRGSPAHSPREMGliDKTRLLKRDAPIS P 
ASIASSSSTPSSKSJCELSLNEKSTTPVSKSNTPTPRTnAPTPGS 
NSTPGIJIPVPGKPPGVDPLASSLRTPMAVPCPYPTPFGrVPHAa 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide } 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(Alanine, C=Cysteine, D«Aspartic Acid, E- 
Glutamic Acid r F-Phenylalanine, G«Glycine, 
H^Histidinc, I=Isoleucine, Kafcysine, 
L^Leucine, ^Methionine, N^Asparagine, 
P^Proline, Q^Glutamine , R^Arginine, 
S=Serine, T-Threonine , V=Valine, 
W=Tryptophan, Y-Tyrosine, X=Unknown, *^Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








MNGBLTS PGAAYAGLHNI S PQM S AAAAAAAAAAAYGRS PWGFD 
PHmimVPAIPPNLTQIPGGKPAYSFHVSADSQMQPVpFPPDAL 
IGPGI PRHARQ INTL^GEVVCAVTISNPTRHVYTGGKGCVKVW 
DISHPGNKSPVSQbDCLNRDNY IRSCRLLPDGRTIj IVGGEASTL 
SIV?DIiAAPrPRIKAELTSSAPACYALAISPDSKVCPSCCSDGNI 
AVWDLHNQTLVRQFWm)GASCIDISNDGTKLWTGGI^ 
W\DLRBGRQLQQHD/FFTSPVFSLGYCP\TBEWLAVGMENSN\V 
EVlaiVTKPDKYQiaiJIBSCVLSLKF^ 

W\RTPYG\ASIP\Q3K3SSS\VLSC33I\SVDDKYIVTGS\GI)K\ 
RATVYBVIY 


6034 


2683 


714 


ESGRRRKIiKRRRSPCPGTAGGPGETNPGPGACPHGPREEAAAAM 
EIAPQEAPPVPGADGDrEBAPAKAGSPSPASPPADGRLKAAAKR 
VTFPSDPJDIVSGAVE PKD PWRHAQNVTVDRVTGAYKQACQKIiNC 
RQI PKLLRQLQEFTDLGHRLDCIiDLKGE KLD YKTCBAIiKEVFKR 
I^FKWDI^QTNLDEDGASALFDMIBYYBSATHLNISFNKHIGT 
RGWQAAAHMMRiCTSCLQYI,\DAROT 
IAVIjHLBNaSIi3GIWI*!LI»ATAIJCMNK^ 
DSAQLGNIjLKFNCSLQI l»DLRNNHVLD SGIiAY icbglkbqrkgi, 
VTL\VLWNNQLTHTGMAFXXjOT^ 

RHLKNGI>ISNRSVLRLGLASTKIjTCRGAVAVAEFIAES prixrl 

DI»RKNBIKTGGIiMALSIiAI»KVMHSLLR1J>I^ 

TQKALIJ^IQNGCKRNLVIJ^EREBKBQPPQI*SASMPF/rTATBP 

qpddepaagvqngapspapspdsdsdsdsdgeebfjsbegerdbt 
psgaidtrdtgssepqppprpprsgpplpnglkpefalalppbp 
ppgpbvkggscglehelscsknekeleellleasqbsgqett, 


6035 


19 


404 


SVTYLGIILHKNTGALPADPVQLISQrPTP£ri'K^^ 
YFYLHIPGFAILTKPIXTKLTKBNI^AIDPKSFSHSSFRSLKI^ 
LENAffTLAlJ?DSSQPF\SIiBTARVQGCVVE I LTQOLG PLP V 


6036 


1745 


356 


LPDVEKLGRRRGRKMDSVEKGAATSVSNPRGRPSRGRPPKtiQRN 
SRGGQGRG VEKPPHIiAALI LARGGS KGI PLKNI KHLAGVPLIGW 
VLRAALDSGAFQSVWVSTDHDE I ENVAKQFGAQVHRRSS £ VSKD 
SSTSLDAIIEFLNYHNF^IVGNIGATSPCLHPTDLQHVAEMIR 
EEGYDSVFSVVRRHQFRWSBIQKGVRBVTEPLNLNPAKRPRRQD 
WDGELYEMGSFYFAKRHLIEMGTYLCX^KMAYYEMRAEHSVDI^ 
DI D W P IAEQRVLR YGYFGKBKLKB I KLLVCN IDGCLTNGHI YVS 
GDQKBIISYDViamiGISIiLiaCSGIEVRIilSBRACSKQTr^SLK 
U>CKMFA^SDKIAVVDBWRKEMGLCWKBVAYIX3NEVSDEBCn J K 
RVGLSGAPADACSTAQKAVG Y I CKCNGGRGA\ IRBFAEH I C\ LL 
MEKGLINFM PKNRNLAVN I GEKK 


6037 


2936 


1919 


WTSWWMSSVLTILLFSLQGNKMIjNYSAPSAGGYIiLPRKPVGTPA " 
GGGFPRBHSVTLPSSKFRQNQLLSSLKGKPAPALSSRDSRFRDR 
SFSBGGERliLP-l^KQPGGGQVNSSRYKTXELCRPFEBNGACKYG 
DKCQFAHGIHBI^LTRHPKYXTEI^TFHTIGFCPyGPRCHFI 

hnaberralagardlsadrprlqhsfs fagfps aaataaatgll 
dsptsitpppii^addllgsptlpdgtnnpf\afssqklaslfa 

PSMGLPGGG3PTTFLFRPMSBSPHMFDSPPSPQDSr*SDQEGYT*S 
SSSSSHSGSDS PTLDNSRRLP I FSRLS I SDD 


603B 


1450 


426 


SSALQEFGTRNHTFGVPLPHR RKQ I ISCNICXJLRFNSDSQAAAH" 

YKGT KHAKKL KALFJ^MKl^QKS VTAJO)SAKTTFTS I TTNT INTS 

SDKTDGTAGTPAISTTTTVBI R5CSSVMTTEXTS KVEKSPTTATG 

MS5^PSTETKERyAKT?rjAVf*QT.r'inyavMfitt<^T.Tg&tr^ror^iriJV > 
noov. r o i a l aaDAAjutulj \ 1 wijV#IVi V AVnon^uliAluloV} 1 Jvtlxv 

TMLEARNGSGTI KAFPRAGVKGKGPVNKGNTGLQNKTFHCE ICD 
VHVNSBTQLKQHISSRRHKDRAAGKPPKPKYSPYNKLQKTAHPL 
GVKLVFS KEPSKPLAPRILPNPLAAAAAAAAVAVSS PPSLRTAP 
AATLFQTSAIiPPALLKPAPGP IRTAHTP VLFAPY 


6039 


4073 


1000 


IOEYKARLTIAKLDDFEEI3NEDDDFJ*RWQRBKAAiQTEIjINKL 
NFLDEABKBLATVNSNPFDDPDAAELNPFGDPDSEEPITETASP 
RKTFJ)SFYNNSYNPFKEVQrrPQYIiNPFDEPEAFVTIKDSPPQST 
KKKNIRPVaMSKTLYADSSKTF^EBXDESNPFYBPKSTPPPXiNL 
VNPVQBIiBTBRRVKRKAPAPPVLS PKTGVLNBNTVSAGKDLSTS 
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SEQ 
ID 
HO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Pieoxccea ena 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine f C«=Cyoteine f D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=I»ysine, 
L=Leucine, M-Methionine , N=^Asparagine, 
P=Proline, Q=Glut amine, RsArginine, 
S-Serine, T-Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosina, X- Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PKPS PI PSPVLGRKPNASQSLLVHO^EVTKNxT^KIlNFTTSW 
RNGIjSFCAI LHHFRPDLI DYKS LNPQD I KENNK KA YDGFAS I GI 
SRLjjEPSPMVLIiAI PDKLTVMTYLYQIRaHFSGQEI»NWQIEEN 
SSXSTYKVGNYETDTNSSVDQEKPYAEI^DLKREPELQQPISGA 
VOFLSQDDS VFVNDSGVGESESEHQTPDDHLSPS'lAS PYCRRTK 
SDTEPQKSQQSSGRTSGSDDPGICS^TIT3STQAQVlJt/3KKRIiLKA 
BTIiEI»SDLYVSDKKKDMS PPPI CEETDEQKLQTUDIGSNLEKEK 
LENS RSLE CRSDPES P I KXTSLS PTSKIiGYSYS RDLDLiAKKKHA 
SLRQTKSDPDADEnTIiNHADHSSKIVQHRLLSRQKELKERARVL 
riEQARRDAALKAGNKHNTNTAAPFCNROT-SDQQDEERRRQLRER 
ARQLlAEARSGGKMSBLPSYGERAAEKLKERSKASGDENMsTEEl 
m'NEKIPEGFVVGGGDELTNLKNDLDTPKQNSKLVDI.KLKKLLB 
VQPOVANSPSSAAQKAVTESSEQDMKSOTBDI^TERiQKTTERF 
RKPWPSKDSTVRKTQLOS ESQ Y I ENRPEMKRQRS IQEDTKKGN 
EEKAAITETQRKPSEDEVIjbnCGFKDS\SQYVVGEDAAI,ENEQKQ 
IOTttAAbVEKRliRYLMDTGRNTO 

RKNQLStJ^KBHDI^RYEIil^REIJJAMLAl EDWQKTEAQKRRE 
QKLIJDE IjVALVN KRDAL V R DI J3 A0EX QAKKKDKHLERTIJ5QNKG 
KMAKKEEKCVLQ 


6040 


475 


1052 


PTALMTAPSCAF PVQFRQP S VSGLSQ ITKSLY I SNGVAANNKLM 
I^SNQITM\narVSVEVVNTI.YEl)IQYMQVPVAD 
PIADHIHSVEMKG^\TIiLHC^AGVSRSAALCLAYLM^^ 
LDAKTWTKBCRP 1 I RPNSGFWEQL IHYBFQI*FGKNTVHMVSSPV 
GMIPD I YEKEVRLM I PL 


6041 


2 


38BG 

* 


TEKDEKTAHNLEHVIiIHFWERL^ICVAK^ 

VLQKPKGSLKSSKKKNGKVRPADE ILESNKBNEKCVS S EG 
BKLECMEiTTEPSLTHNSSGI^PLR^ 

VNER KS EQHLRFLSTLIjDS ES S SRVFKMLI/3DE KQS IVQARPIJ3 
I AKLVQKNPAVQFL YQ KIj I GWLNEDQRKDFGFIjVDI I>YSALRCC 
DNDMERKXVLDDLTKVBIiKWNSLLKl IEKACPSSDKHALVTPWI* 
KGDILGEKLVNIiAI>CLCN^LESRVSSESHFSERWTr»LSLVLSQ 
HVKlTOYIiIGEVYVERI IVRLHETLFKTKKI^EAESSDSSVS FIC 
DVAYNYFSSAKGCLLMPSSBDI^TLFQLCAQSKEKTHLPDFLI 
CKL KNTWf vSGVNLLVITQTDS S YKEST Fr^HLSTVLWI.KNQVQAS C L 
D I NSIXJVLLSAVDDLLNTLLES EDS YLMGVYIGS VMPND3E WEK 
MRQS^MQWIiHRPLLEGRL^LlTrECFI^^ 

SALLSKMVLTAliRKBTVIiENllBLBKl lAEIJj YSLQWCEEIiDNPP 

IFLIGFCEILQKMNITYDNLRVLGNMSGL^ 

SLI IAKI,II^RSISSDEVKPHYKRKESFFPLTEGNLHTIOSIiCP 

EI^KEEKKE FSAQC I PAIiLGWTKXDLCSTNGGPGHIAI FWSCLQ 

TKS IDDGEULHGir.KI I ISWKK3SHRDIFLFSCNLSEAS PKVLGV 

NIKIIRFI^LPl^CSSPIiABSB1«DFIMCSMI»AWI3TTSEN0AIi 

YSIPLVQLFACVSCDLACDI^AFFDSITIJyTIGNLPVNLlSEWK 

BFPSQ^IHSLIJjPILVTVTGENKDVSETSFQNA^KPMCETLTY 

VQIAVYE^LYTGUflPEIiPQYDQDNLKSYGDKEEEPALSPPAALMS 
IiLS IQEDLLENVLGC I P VGQ I VTI KPIjSEDFCYVLGYIJjTWKLI 
I/TFFKAASSQLRALYSBmiRKTKSUiTKLLYH^ 
TAVEVPNKDPKTFFTBELQLS irettmlpyhi phlacsvyhmtl 

KDLPAMVRiWWNS S EKRVFNI VDRFTSKYVSS VLSFQB 1 3S VQT 
STQLFNGMTVKARATTREVMATYTIKDIVIBL1 IQIiPSNYPLGS 
I r^SGKRVGVAVQX3WRNWMLQLSTYI < THQNGS IMEGLALWKNN 
VDKRFEGVEDCMICFSVIHGFNYSLPKKACRTCKKKFHSA\CLY 
KWFTSSNKSTCSLCRBTFF 


6042 


1306 


253 


MAEIAPASPSD I KASVSNGDTTLLCSRRQSCGMCJE VRQ VSLTYP 
GSPAPSHSLPLQPRSGGSLCPSRAW/PDPHQLFDDTSSAQSRGY 
GAQRAPGGI^ YPAA5MPHAAFT^P VSNMAMAYGS SIiAAQGKB 
LVDKNIDRFXP ITKLKYYFAVDTMYVGRKLGIxLFFPYLHQDWEV 
QYCXJlXrPVAPRFDVmPDLYIPAMAFITYVLVAGLAT/GTQDRFS 
PDLI^LQASSAlA^TLBVI^IiLSI.YLVTVin'DLTTro 
GYKYVGMIGGVLMGI^FGKIGYYliVliJWCCVAI FVFHIRTLRLK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location, 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AtaAlanine , OCysteine, D-Aspartic Acid, E» 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Hi£jtidine, I^Isoleucme, K— Lysine, 
I>=Leucine, M=Methionine, N=Asparagine , 
P«Proline, Q=Glutaraine, R=Arginine , 
SsiSerinei T=Tnreonine, vwvaline, 
W-Tryptophan, Y-Tyrosine, X= Unknown , *«>Stop 
Codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion) 








ZIJVDAAAEGVPVRGARNQIiRMYLTMAVATvAQraLM 5f WLTFHLVR 


6043 


403 


599 


LCLFFPPPCATPVLPLPSLISAL/CLSHLSVSSWFCPCQPPLPC 
PLPPLQNKTAKGSI^TEQSBRG 


6044 


793 


412 


KLEMWNPTLISKVKISREVTMIASKFGIGQQVRHSLLGyLGVVV 
DIDPVYSLSE PS PPKLAVNDELRAAPW YHVVMEDDNGLPVHTYL 
AEAQIiBSBLQDEHP\BQPSMDBIiAQ!riRKQLQAPRLSN 


6045 


1S5 


2299 


SPLPQVAAMNYromLSDSNFMANLPKGYPrro 

AHSPGATPGPGTATAKHSSGVAPAASPAAPSPGSSGGGGFFSSIi 

SNAVKQTTAAAAATFS EQVGGGSGGAGRGGAASR VLLVT DE PHT 

DWAKYFKGKKIHGB IDI KVHQABPSDLNLVAHANGGPS VE)MBVIi 

RNGVK^HSLKPDFVLIRQllAFSMARNGDYRSLVIGLQYAGIPS 

VNSLHSVYNFCDKPWVFAQMVRiH^ 

BMLSS\TTYPVWW1GHGTLW5WGKVKV^^ 

FCITATAKPFinAKYDVRVQKIGQNYKAYl^TSVSGlWKmroSA 

MLEQ IAMSDRYKliWDTCGEIFCXUJUCAXn^IIGKDGRDHX IE 

VVGSS^LlGDHQDEDKQLIVKLVVNKMAOALPRQRQRnASPGR 

GSHGQTPS PGALPLGRQTSQQPAGPPAQQRPPPQGGPPQPGPGP 

QRT^PPLQQPJ?PPQGQQHI*SGLGPPAGSPLPQRI»PSPTSAPQQP 

ASQAAPPTC^QGRQSRPVAGGPGAP PAARPPA3 PS PQRQAGPPQ 

ATRQTSVSGPAPPKA3GAPPGGQQRQGPPQKPPGPAGPTRQASQ 

AGPVPR0X3PPTTQQPRPSGPGPAGRPKPQLAQKPSQDVPPPATA 

AAGGPPHPQI«NKSQSLTNAFNIiPEPAPPRPSl>SQDEVKAKTIRS 

LRKSFASLPSD 


6046 


212 


1075 


EGLTGPCERVPPIiLGRGPPHGATRAGHRRAVRWAGPBSLPPLPR 
SLIMDS PRAGTHQGPLnAETEVGADRCTSTAYQEQR PQVEQVGK 
QAPLS ?GLPAMGGPGPGP CEDPAGAGGAGAGGS EPLVTVTVQCA 
FTVALRARRGADLSSI.RAL1^AI»PH0\A0U5QI>SYLAPGEDGH 
WVP I PE EES IiQRAWQDAAACPRGLQLQCRGAGGRPVL YQVVAQH 
SYSAC<3PEDLGFRQGDTVPVIjCEVDQA^ 
WPAGPRMSGAPGRIiPRSQQGDQP 


6047 


49 


1405 


PVItVTS LRMREADTLRPPQLMEVSADI ISTVBPNHTGELIATGD 

KGGRWIFQREPBSKNAPHSQGBYDVYSTFQSHEPEFDYLKSLE 

IEEKINKI KWLPQOjMAAHS U^STNDKT I KLWK I TERDKRPEGYN 

UO>BBGKIJ©I^TVTSLQVPVLKPMDLMVEVSPRRIF^ 

IN£ISVNSrK:ETYMSADDIiRINLWHI*AITDRSFTP\NIVr)XKPA 

NMEDbTEVTTASEPHPiniCNLFVYSSSKGSLRLCX)MRAAAIiCDK 

HSKLFEEPEDPSNRSFFSEriS\SVSDVKPSHSDRYMLTR\DYL 

TVKVTOL\NMBARPircrYaVHDx^ 

WNGS DS V I MTGA\ YTWPFPJ<FDRNTKRPVTL\RASRESS KPRAV 
LKPRRVCVGGKRRRDD I S VDS LDFTKK J LHTAWHPAEN I IA IAA 
TNNLYI FQDKVNSDMH 


6048 


1 


3194 


GIRTPKFCE)SPTSDLEMRNGRGRi3KRMRPNSEJTPV^ 
KGTSNSSKTRAGANSKGRRGSQI^ETOPPASSTSBDVKASPSS 
ANKRKNKPLSDMELNSSSEBSKGSKRVRTNSI^SATGPLPGTKV 
KPTVIjDRNCPS PVL IDCPH PNC!^^KKYKHINGIlKYHQAK7^HTDDD 
SKPEADGDSEYGEKPILHADLGSCNG\ASVSgK\GSLS PARSAT 
PKVRLVEPHSPSPSSKFSTKGI^XKKI^BGDTO^ 
DGPSVMDETSNDAFDSLRRKCMEKEKCKKP3SLKPEKIPSKSLK 
SARPI /APIAIPPQQIYTFG/rATFTAASPGSSSGLTATVAQAMP 
NSPQLKJ^IQPKPTT^MGEPFTVNPALTPAKDKKIOKDKXKKBSSKE 

IJ^IKAEADKIYSFTDMAPSPSlGGSSPXRNTTPTQPLTPIJnAr 
TQNGARASSVKTNS PAYSDISDAGEDGEGKVD5VKS KDAEQLVK 
EGAKKTLFPPQPQSKDSPYYQGFESYYSPSYAQ6SPGALNPSSQ 
AGVESQALKTKRDEBPBS1EGKVKND1 CEEKKPELSS S SQQPSV 
IQQRF^^MYMQ SLYYNQYAYVPP YGYSDQS YHTHIiL STNTAYRQQ 
YEEOAJKRQSLEC^RGVDKKAF^GIJCERFAALKEKHKQKPS XPP 
TLTKAPSLTDLVKSGPGKAKEPGADPAKSVI IPKLP0SSKLPGQ 
APKGLKVKIjSDASHLSKEAS EAKTGAECGRQABMDP I LWYRQBA 
EPRMWTYVYPAKYSDIKSFX^ERWKEERDRKIiKEERSRSKDSVpK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acj^d 
residue of 
amino acid 
sequence 


Amino acid segment containing Bignal peptide i 
(Ar»Alanine, C=Cysteine, D=Aspartic Acid, E== 
Glutamic Acid, ^Phenylalanine, <3=Glycine/ 
H=Hi s t idine , I=Isoleucine , K-Lysiue , 
L«Leucine, M ^Methionine, N^Asparagine , 
P= Proline, Q^Glut amine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
H^Tryptophan, YeTyxosine, X=Unknown, *^Stop 
Codon, /=possible nucleotide deletion, 
V=possible nucleotide insertion) 








HIX3KBSTSSDCKLPTSEE5RLG3KBPRPSVHVPVSSPI,TQHQSY 
IPYMHGY8 YSQS YDPNHPSYRSMPAVMPflONYPGS YhPSSYSFS P 
YG3KVSGGE DABKARASPS VTCKSSSE S KALDI LQQHASH YKS K 
SPTISDKTSQERDRGGCGWGGGGSCSSVGGASGGKRSVDRPRT 
SPSQRI^STHHHHHHIjGYSLLPAQYNLPYAAGLSSTAIVASQQG 
STPSLYPPPRR 


6049 


215 


1089 


AMTGVFDRRVPS IRSGDFQAPFQTSAAMHHPSQESPTIiPESSAT 

dsz>yys ptggaphgycsptsas yg\kaiinp yqyqyhgvngsags 
ypakayadysyassyhqyggaynrvpsatnqpekevtepevrmv 
i^kpkkvrkprtiyssfqiaaix)RRPuktqylalperaelaasl 
gltqtqvicrwfqnkrs ki kkimkngem ppehs psssdpmacnsp 
qspavwbpqgssrslshhpiiahpptsnqspassylbnsaswyts 
aassinshlpppgslqhplalasgtly 


6050 


566 


1718 


kglkrtccambbsdsbkttbkenlgprmdpplgepg\gslgwvi. " 
pntamkkkvllmgksgsgktsmrs1 1 fanyiardtrrlgatild 

RIHSt^IHSSLSTYSliVDSVGirrKTFIIVBHSHVR^ 
DCGGQDTFMKNYFTSQRDWI FRNVB VLI YVFDVESRBIdBKDMHY 
YQSCLEAILQNSPDAK I FCLVH KMDLVQEDQRPLIPKRREEDLR 
RI^RP^CSCPRTSIWDETLYKAWSSr^TYQLIPWQQrJBMNIJW 
FAE 1 1 EADEVLLFERATFLV I SHYQCKBQRDAHRFEKI SNI I KQ 
FKLSCSKIAASFQSME VRNSNFAAF I DI PTSNTYVMWMSDPfl I 
PSAATIONIRNARKHFEKLERVDGPKQCLLMR 


6051 


566 


171B 


KGLERTCCAMEESDSKKTTEKENIiG?R^PPLGEPG\GSIiGW^ 
PNTAMKKKVLLMGKSGSGKTSMRSI I FANYI ARDTRRLGAT ILD 
RIHSIiQINSSI^rYSLVDSV^TKTFDVBHSHWPIiG^VLNLW 
rXXJGQDTFMENYPTSQRDNIFW^EVLIYVFDVESRELEKDMHY 
YOSOiEAILQNSPDAKlFC^VHKMDLVQEDQRDLIFKRREEDLR 
RLSRPI*BCSCFRTSrWDKriiYKAWSS 1VYQLI PNVQQLEMNLRK 
FAEI IFADEVI,T.PERATPLVISHYQCKJEORDAHRFEKISNI IKQ 
FKLSCSKJAASFQSMEVRMSNFAAFIDXFTSNTYVMVVMSDPSI 
PSAATI*INIRNARKHFBKLERVIX3PKQCIjI«MR 


6052 


566 


1718 


KGLERTCCAMEESDSEKTTEKE^I.GPRMDPPLGEPaVGSLGWVL 
PNTAMKKKVLI«MGKSGSGKTSMRS 1 1 FANYIARDTRRLGATILD 
RIHSItflNSSI^TYSLVI^VGOTKTFWEH 
DCGGQDTFMENYFTSQRDN I FRNVEVLI YVFDVESRBLEKDMHY 
YQSCLEAlLQMS PDAKI FCLVHXMDLVQEDQRDLIFKBREEDIjR 
RLSRPLECSCFRTSIWDETLYRAW3S I VYQLIPHVQQItEMNliRN 
FABI IEADBVI^FERATFIiVXSHYQCKEQRDAHRFEKISNI IKQ 
FKI^CSKIiAASFQSMEVRNSNF7AAFlDIFTSNTYVMVVMSDPSI 
PSAATLINIRIUlRlCHFEKliERTOSPKQCLUfR 


6053 


201 


1704 


KGTEMNKSRWQSRRRHGRRSHQQNPWFRLRDSKDRSDSRAAQPA " 
HDSGHGDDBSPSTSSGTAGT5 S VPELPG FYFDPEKKR YFRIiL PG 
HNNCNPI iTKES I RQKEMES KRIjRIJjQEEDRRKKI ARMGFNAS SM 
LRKSQI^FIiNVTNYCBSiAHEIiRJbS^ 

RFNLILADTNSDRLFTVNDVTVGGSKYGI INLQSLKTPTLKVFM 
HBtoYF1TOKV\NSVCHASIJ9HLDSHI^ 
PASLFWSHPAGIDRP<j\MI,CSFRIPGAWSCaWSLHIQANNCFS 
TGF»SRRVLI.TNWTGHRQS FGTNSDVIiAiQQFALMAPId,F^GCRS 
G2IFAIDLRCGWQGKGWKATRLFHDSAVTSVR1LQDBQYLMASD 
MAGICCKLWDLRTTKCVRQY BGH VNEYAYLPmVHEE EG ILVAVG 
QDCYTR I WSLHDARLLRTI PSP YPASKADI PSVAFS SRLGGS RG 
APQLLMAVGQDLYCYSYS 


6054 


1 


1054 


PPIARIXJEFGTSRRHMAAPSGVHIjLVRRGSHRIFSSPLNHIYIjH 
KOSSSQQRRNFFFRRQRDISHS I VLPAAVSSAHPVPKRTKKPDY 
VTTGIVPDWGDSIBVRTJEDQIC^LHQACQIAnHVIJJAG 
DMTTEEIDALVHRE 1 1 SHNAYPS PLGYGGFPKSVCTSVNNVLCH 
GIPDSRPLQDGDI INI DVTVYYNG YHGGDTSETFLVGNVDEOGKK 
LVBVARRCRDEAJAACRAGAPFSVIGOTISHITHQNGFQVaPHF 
VGHGIGSYFRGHPEIWIIHANDSDLPMEEGMAFTIEPIITEGSPE 
FKVL KDAWTWSLD/TSKVSAQFEHTVL I TS RGAQI LTKLPHEA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, 
Glutairic Acid, F=Phenylalanine f G»Glyeine, 
H-His t idine , I-I soleucine , K=Lysine , 
L= Leucine, M=Methionine , N=Asparagine, 
P^Proline, Q=Glutaraine, R^Rrginine, 
S-Serine, T^Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=*possible nucleotide deletion, 
\=possible nucleotide insertion) 


■ 6055 


421 


2364 


PPYFLLSPIAWWLYGQSDRTETD X SQSAGPPPGTIiQCaAtiHHDP 

GCANCSRPCRD^PPACQCHTIWFPGNAI^NGVQPPBLSRTI^ALI 

SSREPPRKKWCSQTETGKERBRTSPLTQGGKRP1CLQHGLAGICM 

TLLITGDS I VS AEAVWDHVTMANRELAPKAGDVIKVLDASNKDW 

WWGQIDDEEGN FPAS F VRI>WVNHEDEVEEGPS D VQNGHLDPNSD 

CLCLGRPLQNRDQMRANVINBIMSTERHyiiQlira 

RKPJU)MFSDEQLKVIPG1«FJ}IYRFQMGFVRD 

SEIGPCFI^HQDGFWIYSEYCNNHIJ^CMBI^KLMKDSRYQHFF 

FACRLUXWIDI A\I DGFI^TPVQKICKYPLQIiAFJaLKYTAQDH 

SDYRWAAALAVMRNVTO^INERKRRIjENIDKIAQWQASV^ 

GBDILDRS$ELIYTGRMAWIYQP\YGRNQQRVFFLFDHQMVTjCK 

KDIjIRPJ)ILYYXGRIDMDKYEVVDIEDGRDDDFNVSMKNAFKLII 

NKETBBIHLFFAKKLBEKIRWLRAFRBERKMVQBDEKIGFEISE 

NQKRQAAMTVRKVPKQKGVNSARS VP PSYPPPQDPLKHGQ YLVP 

\DGIAQ3QVFEFTEPKRSQSPFWQNFSRLTPFKK 


6056 


43 


"3358 


SGGRGPVRVRSEQLSPSAEQVSQISQISIiGRRPbSSLPPPPSRA 

LAPTRAPDTALTIMEVAWESFO^SCKIhrrFTlPSMEEFREFNK 

YLA YMESKGAHRAGIAKVI P PKEW KPRQCYDD I DIUjLI PAPIQQ 

MVTGQSGLFTQYN I QKKA^r^VKRFRQLANSG^CYC^^PRYIlD YEDL 

BRKYWKNKTFVAP I YGADINGS IYDKGVDBWNIARLNTVLDWE 

BBCXJIS IBGVNTPYLYPGMWKTTFAMHICTMDLYSINYLHFGBP 

KSWYAI?PFJIGKRIJERIAQGFF^SSSQGX3)AFL1OT 

LKKYGI PFTIKITQEAGEFMITFPYGYHAGPKHGFNCSVES'TOFAT 

VRWI D YGKVAKLCTTCRKDMVKISMD I FVRKFQPDRYQLWKQG KD 

IYTIDHTKPTPASTPEVKAWLQRPJIKVRKASRSFXJCARSTSKRP 

KADKEEKV^DRVDGAPATPNPDSVTDDIiKVS 

SSEEE SSA3RMQVE^1»SDHI KLSGNSCXiSTS VTED IICrEDDKA 

YAYRSVPSISSEADDSXPLSTGYEKPEKSDPSKLSWPKS PESCS 

SVAESNGVLTEGEESDVESHGNGLBPGEIPAVPSGERNSFKVPS 

I AEGENKTSKSWRHPLSRPPARS PMTLVTOQQAPSDEELPEVLS I 

EEF^F^ESWAKPliIHLWQTiO?PNFAABQEYNATVARMKPHCAI 

CTLLMPYHKPDSSNEENDARWBTKIjDEVVTSEGKTKPIj^ pemcf 

IYSBF^F^SPPNAFIiF^IXSTSI^ISCAKCaWVHAfiCYGIPSH 

BICTXSWLCARCKiUiAWTABCa^CNI^GGALKQTKN^ 

vavpevrftnvpbrtqidvgri plqruclkci FCRHRVKRVSGA 
ciqcsygrcpasfhvtcahaagvtAmepddwpyvvnitcfpjikv 

NPlWKSKA<3KVISVGtfrVITKHRiaTRYYS 

MFDDGS FSRDTFPEDIVSRDCI>KI^PPAEGEWQVXHPDGKLYG 

AKYFGSNlAlIMYQ\nEFEDGSQIAMKRBDIYTU)EBLPKRVKARF 

VSAJ^CHLGTCX3^SI^SPHVS0ACXJETYLGFW F 

LSGTY 


6057^ 


1 


B53 


FVARliKBQEGEGGLGPRKBKGRARGnRERRRKMQIVrRCCFVFLVQ 
GSLYLVICGQDDGPPGSEDPBRDDHEGQPRPRVPRKRGHISPKS 
RPHANSTLIXJIjLAPPGFJ\WGIIjGQPPNRPNHSPPPSAKVKKIFG 
WGDFYSNI KTVALNIiVTGKIVDHG^TFSVHFQHMATGQGNlS 
ISIiVPPSKAVEFHQEQQIFIKAKASKIFNC\RMBWBKVE\RGRR 
TSLFTHDPAKICSRDHAQSSATWSC^PFKVVCVYIAFYSTDYR 
LVQKVCPDYNYHSDTPYYPSG 


" 6058 


1 


986 


HPLPSASLGLPSVSLGVSIjCVRSAIiljFAWPhu^iPKRJlRARVGS P 

SGDAASSTPPSTRFPGVAIYLVKPRT^SRRAFLTGLARSKGFR 

VLDACSS EATHWM EF/PS AEBAVS^FJtRMAAAPPGCTPPAlaLD 

1 SHUXBSj LGAGQPvPVECKmUjEVAGPoJ^i/Jjb 

PTPLTHHNTGI^EALEIIAEAAGFEGSBGRIiTFCRAASVLKAL 

PSPVTTLSQLQCLPHFGEHSSRVVQELIiBHGVre 

RLFTQI FGVQ VKTADRVfYREGLRTLDDLREQPQKXTQQQKAGB P 

SREAGPWASXiNCTLDPSASTP 


6059 


2 


3650 


QQDFSSLADLTDHRAHRCPGJDGDDDPQI*SWVASSPSSKI)VASPT 
QMIGIXXIDI^I/SEBSGGTGI^PCQFCDKSFIRLSYJj^TIBQJH 
SDKLPFKCTY CSRLFK^KRSRDPJl I KI^TGDKXYHCHB CEAAFS 
RSDHI»KIHWCTRSSSKPFKCrVCKR^ 
EHIJUCSEKFJVKKDDFHCTJYCEDTFSQTO 
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SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=JManine, O Cysteine, D=Aspartic Acid, B« 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, lalsoleucine, K=Lysine, 
LnLeucine, M=Methionine, N-Aspacagine, 
P=Proline, Q~Glutamine , R^Arginine, 
S=Ser ine , T=Threonine , V= Val ine , 
W=Tryptophan, Y=Tyrosine, X^Unknovm, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








ADL^IHCPEVFVDENTIiLAHIHOiUIANQKHXCPMCPE \QFSSV 
\KGVyCHLDSHRQPDSSNHSVSPDPVLGSYASMSSATPDSSASV 
KRGSTPDSTLKPLRGQKKMRDIXjQGVrrKVVYSCPYCSKRDFNSIi 
AVLErHTiKTIHADKPQQSHTCQICI*D5^ 

HAYPVMQPGNISAFHCNYCPEMFADINSLQEHIRVSHCGPNANP 
SDGNNAFFCNQCSMGFLTCSSLTEHIQ\Q\AHCSVGSAKIiESPV 
VQPTQSFMEVYSCPYCTNSPIFGS ILKLTKHIKKNHKNIPLAHS 
KKSKAEQSP VSSDVEVSSPKRQRI»SASANS I SNGEYPCNQCDLK 
FSKFF?yQTF"'iyTrm f KT f TrT^^0AP^gKgpyDSQBSIJOHLTVH 


YMTTSTHYVCESCDKQFSSVDD\l^KH\lJJ)MPHPLCXrraOT 

CQEVFDS\KVS I \ QVHLAVKHSNB KKMYRCTACNVJDFRKEADIjQ 

VRVKHSHI^NPAKAHKCIFOGrTrFSTEVEL^ 

FCSKAFHAI ILLE KHLRKKHCVFDAATKNGTANGVPPMATKKAfi 

PADMX5>n^LKNPBMNSHEASEDDVDASKPMYGCDia5AAYTME 

Vlal^NHRUlDHNrRPGEDDGSRKIOVBFIKGSHKCNVCSRTFFSE 

NGI^EHLQTHRGPAKHYMCPICGRRFPSLLTLTFJnCVTH^KSLD 

TGTCR I CKMPLQSKEE F I KHC3QMH PDLRNS LTGFRCWCMQTVT 

STLELKXHGTFHMQKLAGSSAASS PNGQGLQKIjYKCAIiCZIjKEFR 

SKQDLVKLDVNGI^YGI«a^MARSAN^^ 

GLRCPEC3VKFBSAEDLBSHMQVDHRDLTPETSGPRKGTQTSPV 


PRKKTYQCIKOJMTFENBREIQIHVANHMIBEGINHEC^ 
FDSPAKIjIiCHLIEHSFBGMGGTFKCPVCFT^ fa 
VHGQEDKIYDCSQCPQKPFFQTKLQNHTMSQRAQ 


6060 


2145 


202 


SYE IVGKNKtEVNHSQLKAIXZKCSLPSRIJ^LGENLPIiLDRGFR 
KEPRSRGSRBRDNMLHLRllSCI^FRSWLPAMIAVLIiSIiAPSASS 
DI SAS RPNILLLMADDLG IGDIGCYGNNTMRTPNI DRLAEDGVK 
LTQHISAASIiCTPSRAAFLTGRYPVRSGMVSS IGYRVLQWTGAS 
GGLPTNBTTFAKI LEBKGYATGLIGKWHLGLNCESASDHCHHPL 
HHGFDHFYGMP FSLMGDCARWBI£BKRVNIjEQKI»NFIjFQVIiALV 
ALTLYAGKIjTHL I P VSViM P VI WS ALSAVLLLAS S YFVGAL IVHA 
IXIFLMRlfflTITKQPMCFQRTTPI^LQEVASFLKRNKHGPFIiLFV 
SFLHVHI PLITHENFIjGKSLHGLYGDWKEMDWMVGRII^ 
EGLSNSTL1 YFTSDHGGSLENQEjGKTQYGGWNG I YKGGKGMGGH 
KGGIRVT>GIFRWPGVLPAGRVTGEOTSIiMWFPTVVRI^ 
QDRVIDGQDLIiPIiI»ljGTAQHSDHBy^ 

GlM«KVHFVTPVFQ?EGAGACYGRKVCPCreBKVVHHI)PPI^ 
LSRDPSEXmi>TPASBFWYQVMER\VQQAVWBHQRTIiSPVPLQ 
LDRLGNT WRPWLQPCaJPFPJbCWCLREDDPQ 


6061 


110 


1330 


MNIHMKRKTIKK hi I'FKWRhU^LJJGMPAVRVKTELLESEQGS PN 
VHNYPDMFAVPLLLNKWKGEPPEI^ 

ARTSPTAVSSSPVSMTASASSPSSTSTSS3SSSRLASSPTVITS 

VSSASSSSTVLTPGPLVASASGVGGQQFIiHIIHPVPPSSPMNLQ 

SNKI^HVHRIPVVVQSVPVVYTAVTiSPGNVN^ 

HGKAQMDPRGLSPRQSKSDSDDDDLPNVTLDSVNETGSTALS IA 

RAVQEVHPS PVSRVRGNRMNNQKFPCS I S PFSIESTRRQRTVLN 

PPDSRKTAYSTDC^F\EGI>QQKLYTK55SPGRVHRRTHTGEKPY 

KCTWEGCTWKFARSDELTRHYRKHTi^ 

TUT TTT»T1T»T1MT TT 

IAI*HKRHHWLiv 


eo<;2 


71 


1079 


ETl^KNGPENCKDCHILSBVEAFKS^ 
LIVLFWGSKHFWPEVPKKAYDMEHTFYSNGEKK1Q 
TEIFRSGNGTDETLEVHDFKNGYTGIYFVGIiQKCFIKTQT KVI P 
EFSKPEEEIDE^EITTTFFBQSVIWVPAEKPIENRDFLKNSKI 
LE ICDNVTMYW\ IN PTL\ ISGTFAKQUfflNFAFI ILVSELQDFE 
EFJGEDLHFPANBKKGI EQNEQriWPQVKVEKTRHARQASEEELP 
INDYTENG I E FDPMLDERGY CC I YCRRGNR YCRRVCE PLLG YYP 
YPYCYQGGRVICRVIMPCNVJWVARMLGRV 


6063 


71 


1079 


ETMAKNGPENCBDCHIIjNAEAFKSKKICKSIJ^ 


L IVLFWGSKHFWPEVP KXAYDMEHTFYS NGE KKKI YME IDPVTR 

teifrsgngtdetlbvhdfkngytciyfvglqkcfiktqikvip 

EFSEPEEEIDF^EEITTTFFEQSVIWVPARKPIENRDFLKNSKI 
LEICDNVT^fYW\I^^PTI»\ISGTFAKQIHHNFAFIILVSEiQDFE 
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SEQ 
ID 
NO: 


Predicted / 
beginning 
nucleotide ' - 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C-Cysteine, D»Aspartic Acid, E= 
Glutamic Acid, F=*Phenylalanine, G^Glycine, 
H=Histidine, I-Isoleucine, K= Lysine, 
L«Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q-Glutaraine, R=Arginine , 
S= Serine, T=Threonine, V= Valine, 
W=Tryptopnan, Y=Tyrosine, X-Unknown, *«Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








EEGEDLHPPANEKKGIEQNBQWVVPOyKVEKTRHARQASEBBLP 
INDYTENG IEFDPMLDERG YCCI YCRRGNR YCRRVCK PLLGYYP 
YPYCYQGGRVICRVIMPCNWWVARMLGJRV 


6064 


913 


311 


NLPQSL PRPTHHS P P YS LE KMTDLVAVWDVALSDGVHKIEFEHG 

TTSGKRWYVDGKEE IRKEVJMPKI.VGKBTFYVGAAKTKATINID 

MSGEAYBYTLEC^KSLKJCYMEDRS^^ 

LEK3AMD VWCNGKXLBTAGB FVDDGTETHFS IGTH\ACYIKAV\ 

SSG\KRKEGIIHTLIVDNREIPKIAS 


6065 


1153 


641 


MS VR VARVAW VRGLGAS YRRG AS S FPVP P PGAQGVAELIiRDATG 
AEEEAPWAATKRRMPGOCSVI^PPGQGSQWGMGRGIJ^NYPRVR 
Rr,YAAARRVLGYDIJ>ELSIJIGPQETLI»TVHC^PAIPVASLAAV 
EKLHHLQPSVI EN CVAAAGFSVGEFAALVFAGAME EAEG 


6066 


1 68 


3470 


VKENMPATRKPMRYGHTEG1ITEVCPDDSGSFIVTCGSDGDVRIW 
EDLDDDDP KP INVGEKAYSCALKSG KLVTAVSNNTIQVHTFPEG 
VPTCILTRFTTl^HVVFNGDGTKIAAGSSD\Fr.VKIVD 
QQKT?RGHDAPW^LSFDPKDI P1ASASCTCSVRVWQISDQTCA 
ISWPIJ.QKCNDVINAKSICRIAWQ^ 
ESWSHQFDLSDI^ISQTI^rVTWSPCG^ 
ETKDCMEllVKHEKGTCAJOGIAWHPTCXSR^ 
a>PSGKTSSSKVSSRVEKDYNDLFDGDE)MSNAGDFLNDNAVEIP 
S PS KG I IM)DEDDEDIjMMA5GRPRQRSHILEDDE1JS VDISMLKT 
GS S IiLKBEERtXSQEGS IHNLPL VTS OR PFYDGPMPTPRQKP FQS 
GSTPLHIiTHRFWVWNS IGI IRCYNDEQDNAIDVEPHDTS IHHAT 
HLSNTIaNTYTIADLSHEAJL^ 

EWI IDLPQNEDIEAICTiGOGWAAAATSAIjLI^LPTIGGVQKEVF 
SLAGPVVSMAGHGEQLFIVYHRGTGFDGIX^C^ 
QILHGDPLPLTPJCSYIiAWIGFSABGTPCYVDSEGIVRMLNRGLG 
J8TWTPI CNTREHCKGKSDHYWWGI HENPQQLRCI PCKGSR FP P 
TLPRPAVAILS FKLP YCQ IATEKGQME EQFWRS VI FHNHLDYLA 
KNGYEYEBSTKNQATKEQQEl*LMKMIALSaCLEREFRC^^ 
MTQNAVNLAIKYASRSRKL I LAQKLSELAVEKAAK LTATQVEEE 
EEEEDFRKKLNAGYSNTATEtf SQPR FRNQ VEEDAEDSGEADDEE 
KPB IHKPGQNS FS K3TNS SD VSAKSGAVTFSSQGRVNP FKVSAS 
SKKPAMSMNSARSTKILDNMGKSSKK^ IKP 
IjIPKPKPXQASAASYTTOKRNSQTNKTBEVKB^ 
CPPGNTENQRPKTGFQMWLEENRSNI LSDNPDPSDEADI IKEGM 
XRPRVLSTBERKVPIANKAKGETASEGTBAKKRKRVVD&SDETBN 
QEBKAKBNLNtiSKKQKPLD FSTNQKLSAFAFKDE 


6067 


856 


321 


XjPWQRIjGVLLSRGKMAVTGWLESI4?TAQKTALLC^ 
PDGKEt4AEEYt3EKTSEIXVRKWRVKSAI^AMGOWQLEVGDPAPL 
GAGl^GPELIKESNANPIFMRKXITKI^FQWRIRNLPYPKDVYSV 
SVDQKERCI I VRTTNKKYYKKFS I PDLDRHQLPLDDALLSFA\T 
PTAP 


6068 


13 


1730 


GSKMADLJ^EEKPAIAPPVFVFQKDKGQKSPAEQKWLSDSGEEP 

RGEAEAPK^GTGHPE S AGEHAIiE PPAPAGASAS TP PP P APEAQL 

PPFPRELAGRSAGGSSPEGGEDSDREDGNYCPPVKRERTSSLTQ 

FPPSDSEERSSGFRLKPPTLIHGQAPSAGLPSQKPKEQORSVLR 

PAVLQAPQPKALSO/IWSSGTNGVSl^ADCTGAVPAASP 

RSPSEAADEVCAIiBEKEPQKlJBSSNASEBEACEKKDPATQO^FV 

FGQNLRDRVKLINBSVDEADMBIIAGHPSADTP^ 

SLKNSTNSADASSNKFVFGQNMSERVLSPPKLN^ 

AAESG5E S S SQEATPEKESLABS AAAYTKATAR KCLLR KVEVI T 

GEEAE5NVIXJMQCMFVFDKTSQSW\/ERGRGIJ,RI^DMASTDDG 

TLQSR LSD AGPRGSLR \ L I LNTKLNAQMQ I DKAS EK\ S I Rl TAM 

DNBDQGVKVFLISASSKDTCC^AALHH^ 

MPAPEPGAAPSNEEDDSDDDDVLAPSGATAAGAGDEGDGOTTGS 

T 


6069 


583 


27 


PTRPGQAGSSSAMAAQRLGKRVI*SKliQ5PSRARGPGGSPGGLQK 
REARVTVKYDRRELQRRLDVEKWIIX3 

IDELI»BIjESEEERSRKIQGIjIjKS CX5KPVEDF I QELLAKLQGT jHR 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C= Cysteine, D=Aspartic Acid, E=> 
Glutamic Acid, F=*Phenylalanine, G=.Glycine, 
H-Histidine, I-Isoleucine, K=»I»ysine, 
L»Leucine, M=Methionine, N-Asparagine, 
P=Proline # Q^Glut amine , RaArginine, 
S«Serine, T=Threonine, V- Valine, 
fNTryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\-poK oible nucleotide insertion) 








RPALLCVLSCI 


6070 


478 


853 


IRVTVDGBFIiHYI FPLQFbDS PBW/RFTETHRGRHF\QVTJjTAE 
TDCRYVSVmRKKLYIiLFAQHRY I SRLFSVI»IGSDIADKLYAIiND 
RVYIGKRYEYDIRIiPNFYQMSTPE I RRS PLTQHFQNSRRYW 


6071 


2 


1654 


HEARTKGNMAIiARP\VRLFSIjVTRJUULAPRRGLTVRSPDKPLPV 
VRI PVAIjQRQIiEQRQSRRRNLPRPVLVRPGPLLVS ARR PELNQP 
ARLT1^RWFJ^PIASGX5WKSRRARRDHFSIBRAQQBAPAVRKLS 

skgsfadlgawkprvlhalqb\aapevvq\ pttvqsst I PSLLR 

GRHWCAAETCSGKTLSYLLPLI<)RI>T/3VIPSLDSLPIPAPRGL 

VLVPSR3LAQQVRA\mQPIX^I«LIiVRDLKGGHGMRRIRLQLS 

RQPSADVIiVATPGALWKALKSRI.1 SLEQLSFLVLDEADTLLD3S 

FLRLVOTILRKSHIAEGPADLEDPFNPKAOJUVLVGAOT 

tJJSKVASPDAVTTITSSKIJICIMPHVXQTFIJlLKGADKVABLAm 

ILKHRDRAERTGPSGTVLVFCNSfi STVNWLGYII1DDHKIQIII1RL 

QGQMPALMRVGIF^SFQKSSRDILI/rroXASRGI*DSTGVBLW 

YDFPPTLQDYIHRAGRVGRVGSEVPGTVTSFVTHPmJVSIiVQKI 

ELAARRRRSLPGLASSVKKPLPQAT 


6072 


1 


742 


KMERTEMMPT INS QLKF JtS KP FPLVS SSRWLVKRGELTAY VKDT 
VLFSRRTSKQQVYFFLFMDVL1 ITKKKSEESYNVNDYSLEDQLL 
VF>SCI^EI^SSPGKHSST^YSRO^SASHLPTLTVI»SNHANBK 
VEMLLGAETQSBRARW I TA1X5HSSGKPPADRTSLTQVE I VRS FT 
AKQPDELSIiOVADWIjI \yQRVSDGHYEGER\LRDGERGVf FPME 
CAKBITCO^TIDKNVETiMGRLLGLBTNV 


6073 


620 


B60 


PCRRGIrARPLSRRPG/SILVHCAVGVSRSATLVJAYIiMLYHHLT 
I»VEATKKVKDHRGI I PNRGFIJlQIiIiALDRRLRQG]jEA 


6074 


168 


1110 


PGARCMATKLQCPDSMPCHNQQVNSASTPSPEQLRPGDLILDHA 
GGNRASRAKVILLTGYAHSSLPAEI»DSGAC53GSSLNSEGNSGSG 
DSSSYDAPAGNSFI^DOSLSRQIGAQIdOJ^^ 
DKTASRGDFMFSADRLIRLVVEE^LNQLPYlOflCM^TPTGYKYK 
GVKFEKGNCGVSIMRSGEAMEQGLRDCCRSIRlGKILrQSDEET 
QRAKVYYAKFPPD I YRRKVLLM YP IIOTG \NTVIEAVKVI*I KHG 
VQPSVI ILLSLFSTPHGAKS 1 IQEFPKITIIjTTBVHPVAPTHFG 
QKYFGTD 


6075 


320 


1091 


P?TCtiP<iEVEHH\YGYVPII/?NKTLPSROTQCVIVSSSSHIXGT 
KLGPE IERABCT IRMNDAPTTGYSAW S VFR V 
IJUlPQEFVNRTPBTVFIFWGPPSKWKlWSbVRVIQRAGIjVFP 
HMBAYAVS PGRMRQFDDIiFRGETGKDRE KSHSMTLSTliW frTMVIA 
VELOTHVHVYGMVPPNYCSQRPRLQRMPYHYYEPKGPDE 
QNEHSRKGNHHRFITEKRVFSSWAQLYGITFSHPSifT 


6076 


1721 


107 


HPSPTEAPRVQBLTMDCTWRI I.FLVAAATGTHAQVQLVQSGAEV 
KKPGASVKVS CKVSG YTLTELS MHWVRQAPGKGLEWMGAFDPED 
GETI YAQKFQGR VTMTE DTS TDTAYMELS SURS BDTAVYYCATD 
HGDYAFDIWGC<^WTVSSAPTKAPDVFPI ISGCRHPKDNSPW 
LAC^IOTraPTSV\TVTWY14GTQSQA\QRTFPBIQRRDSxYM , rS 
S QLSTPTjQQWRQGBYKCVVQHTASKSKKK I FRWPES P KAQASS V 
PTAQPQAEGSLAi^TTAPATTRNTGRGGE EKKKEKE KEBQEERE 
TKTP EICPSH^XJPT^VYIjLTPAVQDLWLRDKATFTCPVVGSDLKD 
AHI,TWEVAGKVPTGGVEEGIJ,ERHSNG5QS0^SRLTLPRSLWNA 
GTSVTCTLNHPSLPPQRLMAI^PAAQAPVK^ sdppe 
Tk \ is r-Tiir t r t rnjor»T?Qt>D>JTr.T.MWT.RnHrjP!VN T P5?OPAPARPLiPKP\ 
RSrrPWA\WSVLRVPAPPSPQPATYTCVVSHEDSRTLLNASRSIi 
BVSYVTDHGPMK 


6077 


3687 


1268 


LLPDMNLQPI FWIGLISSVCCVFAQTDENRCLKANAKS CGECIQ 

AGPNCGWCTNS TFLQEGMPTS ARCDDIJ2ALKXKGCPPDD I ENPR 

GSKDIKKNKNVTNRSKCTAEKLKPFJ)ITQIQ 

QTPTLKFKRAEDYPI DLYYI^\ D3jS YSMKDDLENVKSIiGTDIiMN 

BMRRlT^FRIGEtKFVEKTVMPYISTTPAKEjRNPCTSBO^CTS 

PFSYiam^LTNKGEVF^LVGKO^ISGNLDSPEGGFnAIMQVA 

VCGSI»IGWRirVTRriLVFST]^GFHFAraX3KI^ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location ■, 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-Histidine, I°Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=>Asparagine, 
P^Proline, Q=Glutamine, R=Arginine, 
S*=Serine, TssThreonine, V^Valine, 
WsTryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide Insertion) 








ENNMYTMSHYYDYPSIAHLVQKLSENNIQTIFAVTEKFQPVYKE 

LKNLI P KSAVGTIiS ANS3NVT QL I IDAYBS LSSEVT LENGKLSE 

GVTI3YQSY\CKNGVNGTGBWGRKCSNISIGDEVQFEISITSNK 

CPKXDSDSFKIRPLGFTBEVEVI LQYI CECECQSBG I PESPKCH 

EGNGrFBCGAC^CNEGRVGRHCECSTDEVNSEDIGCFTARKENQ 

FQKSAS1WGR VPSAGQCVCRKRDNTOE I YSGKFCECDNFNCDRS 

NGLICGGNGVCKCRVCECNPNYTGSACDCSLDTSTCEASNGQIC 

HGRGICEOGVaGCTDPKFXXKiTCEM^ 

F^GBICKDTCTQECSxTNITKVESRDKLPQP 

VDDC?fFYFTYSVNGNNEVMVHVTnE^ECPTGPDIlPIVAGW 

IVLlGIiAI»bLIHKIiLMIIHDRR^FAKFEKE 

KSAVTTVVNPKYEGK 


6078 


1426 


180 


BTEDVMELLEEDLTCP X CCSLFDDPRVLPCSHNFCKKCLEGILE 
GSVPJ^WRPVPFICCPTCRKKTFSYWELIPLQVNYSLKGIVEKY 
NKI XI S PKMP VCKGH \ LGQPIiNI P\ CIj \ TDMQLDIi/CG I C\ ATR 
GEHTKHVFCS IEDAYAQERDAFESLFQS FBTWRRGDALS RLDTL 
BTSKRKSLQbbTKDSDKVKEFFEKI^ 

LAVMQAYD P E INKLNT I LQEQRMA FN IAEAFKWSR PI VFLQQM 
QEFREKIKVIKBTPIJ?PSNLPASPLMKNFDTSQWEDIKI.VDVDK 
I£LPQDTGTFISiaPWSFYKLFIJ,IMJ^^^ 
FDDIATVOCCKTIiSNFSS YliTKTADFIEQSVFYWEQVTDGPFI FNB 
lU'KNFTT.VVLNNVAEFVCKYKLL 


6079 


1586 


141 


ATARDI^CARPvIDRWMESTPSRGLNRVHI^CRNIjQEFI^ 

GVLDRLYGH PATCIjAVFREIiPS IiAKNWVMRMLFLEQPLPQAAVA 

LH\ r KKEFSKAQEBSTGIiLSGLRIWHTQIjLPGGLQGLIIiNPTFRQ 

NIJlIAIiliGGGKAWSDDTSQr^PDKHARDVPSLDKYAEERWEVVL 

HFMVGS PSAAVSQDLAQIiLSQAGLMKSrEPGEPPCITSAGFQFL 

LLDTPAQI^YFMIOT-Cn'AQSRGMDLVEII^FLFQLSFSTLGKD 

YSVEGMSDSLLNFLQHIJtEFGIiVFt3PJa^RRYYPT/RAIAl^ 

SSGVSGAC^TVHQPGFIV\VE^TYRI,YAYTESEI^IAI.lAIiFSE 

MLYPFP\NMVV\ARVTR\ESVgQAIASGITAQQI rHFLRTRAHP 

VW^KQTPVLPPTITIXJIRJjWELERDRIiPvFTEGVLYNQFI^ 

ELL\LAHAPKXGVLVF^/in , PAKPJU1WT^ 

HSS 


6080 


1 


1199 


IETI DHVGEFAMAAQAAGVSRQRAATQGLGSNONALKYLGQDFK 

TLRC^K^DSGVI*FKEPB 

PTBLCPSPG^rVGGATRTOIOCXXJI^ 

YP^PRrKJDFQEKYAGIFnFQPLCPPS?\FWQYGEWVEVVlDDR 
bFTKNGQLU?LH3BQG3re 

EGFEDFTGGISEFYDLKKPPAI^YQIIRKALCAGSLLGCSIDVY 
SAAEAEAI TSQ KLVKSHAY S Vlt3 VEEWFX^HPBXLI PvLRNP WG 
EVEKSGAWSDDAPP^IOTIDPPJiKBEIJDK^VF^EFWMSI>SDFVR 
QFSRLEI CNLS PDSLSS BEVHKJTOJbVLFNGHWTRGSTAGGCQNY 
PGSS 


6081 


3 


865 


B^^Ia4I»PLPLLHA/GAIAQDARFRl ^ BMPESVTVQEGIJCI FVHC 

SVFYLEYGWmSTPAYGHWFREGVSVDQBTPVATNNSTQKVQKE 

TQGRFHLLGDPSRNNCSLS IRDARRRDKGS YFFWVARGR TKFS Y 

KYSPLS VYVTALTHRPDI LI PE FLKSGHPSNLTCSVPWVCEQGT 

PPIFSWWSUUVPTSI/SPRTLHSSVLTIIPRPQ^^ 

GAGVTTERTIQ^VSWKSGTVBBWVI^^ 

I LSFHKKKAVRAVEVEBNVYAVMG 


i;na'> 

tlUOA 


283 


1288 


fiARSFOPTQTRxAPvljAAPG 

PESVGQPEEASPEEQPEEASAEEERPEDQQEEEAAAAA\Y\LDE 

IiPEP1iLA/LRVI*AALPRHE\LV0ACR\I,VCLRWKELVDGM 

LKCMKGLVPEGGVEEERDHWQQFYFLSKRRRNI^RNPCGEBDL 

EGWa^HGGDGWRWELPGDSGVEFTHDESVKKYFASSFEWCR 

KAQra)UiAEGYWEBIJJ3TTQPAIW 

KIjLSEHBNVI^AEFSSGQVAVPQDSDGGGWMB I SHTFTOYGPG VR 
FVRFEHGGQDSVYW KG WFGARVTNSSVWVEP 


6083 


1865 


309 


KQWCAF^GLGMSI^BIJJUJLEEAAEEEEGGSYGBEEEEPAIE 
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SBQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing 3ignal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E- 
Glutamic Acid, F=Phenyl alanine, G= Glycine, 
H=Histidine, I=Isoleucine, Lysine, 
L=Leucine, M-Methionine , N=Asparagine , 
P«Proline, Q=Glut amine, R=Arginine, 
S-Serine, T=Threonine , V^Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=»Stop 
Codon, /=possible nucleotide deletion, 
\ -possible nucleotide insertion) 








DVQKETQIJ)LSGDSVKTIAKLWDSKMFAEIMMKIEBYISKQAKA 

SKVMGPVEAAPBYRVTVDANNLTVB IBNELNI IHKFTRDKYSKR 

FPBI^SLVTNAl^YIRTVKEIiGNSI^KCKNNEmQQIL'raATIM 

WSVTASTTQGQQLSEEELERLEEACDMAliRLNASKHRIYEYVK 

SRMSFIAPf^SIIIGASTAAKIMGVT^LTNLSKMPACNIMLLG 

AQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPIPPPPSVAP\DL 

RRXAARIiVAAKCTLAARVDSFHESTEGOT^ 

QEPPPVKQVKPLPAPUXK»KiatGGPJiYRKMKERLGIjTEIR\KQ 

ANRMS FGE IBEDAYQRDLGPS U5HLGK3GSGR VROTQVNEATKA 

RISKTLQRTLQKQSWYGGKSTIRDRSSGTAS SVAFTPLQGLEI 

VNPQAABKKVAEAKQKYFSSMAEFLKVKGEKSGLMST 


6084 


1865 


309 


KQWCAERRGLG^LADELIiADI£EAAEE£EGGSYGEERBBPAIB 
DVQEETQLDIjSGDSVKTIAKLWDSKMFABIMMKIEEYISKQAKA 

FPBLESLVPJU^YIRTVKEIiGNSLDKCKNNEm^QQILTNATIM 
WSVTASTTQG<X}I£EE£LERIJ3EA^ 

SRMS FIAPNLSI I IGASTAAKI MGVAGGJjTNIiSKMPAC!NT M LLG 

aqrktlsgfsstsvi^htgyiyhsdivqslppippppsvapXdl 
rrkaapxv7vakctiia7^vdsfhbstegkvgyel!0?eierkydkw 

QBPPPVKQVKPLPAPLDGQRKKRGGRRYRKMKKRLGLTBIR\KQ 
ANRMS FGEIBEDAYQKDI^FSLGHI^KiK^GRVROTOVNFJVTKA 
RISKTLQRTLQKQSWYGGKSTIRDRSSGTASSVAFTPLQGIJBI 
VNPC^VAEKRVARA^KYFSSMAEFLKVKGEKSGLMST 


6085 


2 


1456 


SGPRSFCK^RAVGRJSLGGKRKPBVTLI^VSSF^VRRWRRARV 
GVARVKPGNPWKPSPATQVPR/VPAQVYLPGRGPPLRHGKELVM 
DERAYVLYKRAQTGAPC!Ii3FT)TVRnRr>GrWRT^ 

KPQLBLAMVPHYGGINRWVSVnjGE 

RXLQVV3BPQALAAFLRDEQAQMKP I FS FAGHMGEGFALDWSPR 

VTGRIiLTGDCQKNIFILWTPTDGGSWHVDQRPPVGHTRSVEDLQW 

SPTEim/FASCSADASIRIWDIRAAPSXACMLTC^ 

ISWSRREPFLLSGGDDGALKIWDLRQFKSGSFVATFKQHVAPVT 

SVEWHPQDSGVFAASGADHQITQWDLG/IVERDPBAGDVKADPG 

I^LPQQIJ.FVHC^ETEIJCELHWHPQCPGI^VSTAl^GFTIFRT 

ISV 


6086 


2419 


1357 


GAATQHGG^^IJjPCNPHGNGLLYAGFKQDHGCFACG^JEKGFRV 

YNTDPI»KEKBKQEFI.EGGVGHVEMLFRCNYI^ 

NKVM I WDDIiKKXTVI E IEFSTE VKAVKIJ^\DKI VVVLD^ KV 

FTFTHNP \HQLHVFE \TCYNPKGIjCVI»CPNSNNSLIaAFPGTHTG 

HVQLVDIASTEKPPVDIPAHEGVl^CI ALNUJGTR IATASEKGT 

LIRIFDTS3GHLIQELRRGSQAANIYCINFNQDASLICVSSDHG 

TVHIFAAKDPKRNKQSSLASASBT^PKYFSSKWSFSKFQVPSGSP 

CICAFGTEPNAVXAI CADGSY YKFLFNPKGECIRDVYAQFLEMT 

DDKL 


6087 


476 


1B77 


QNSQRTGLPITIFSRS FPLI»TGSDIX^BIWPCTCTWRlfflRQWIRP 
LVAVI YLVS I VVAVPLCVWBLQKLEVG IHTKAWF IAGI FLLLT I 
PISLWVILQHLVHYTQPELQKPI IRILWMVPI YSLDSWIALKYP 
GIAIYVDTCaECYBAYVIYimfGFLTimiTNRYPNL 
OXJKHFPPLCCCPPWAMGBVIiFRCKI^VLQYTWRPFTTIVALI 
CELLGIYDRGNFSFSMAWTYLVI INNMSQIiFAMYCLLLFYKVLK 
EELSPIOPVGKFLCVKI,VVF^SFWQAWIALL\nCVGVISEKHTW 

ewotveavatglqdfiiciemfijvaiaXhhytpsykpyvqeaee 

G5CFI3SF7AMWDVSDIRDDISEQVPJIVGRTVRGHPRKKLFPEDQ 
IX2NF^Sni^3SSQDAISIASSMPP3PMGHYQGFGHTVTPQTTP 
TTAKISDBILSDTIGBKKEPSDKSVDS 


6088 


1684 


689 


GASGLVRJUL0X5GHROJAPVAPKXVPP\mGVKKGF 
LERQRUjRCP PPPI/RRBBKPKHDYHAEIQAFGHRLQBNFSLDLL 
KTAFVNSCYIKSEEAKRQQLG IEKEAVLLNI.KSNQEL5KOGTS F 
SQTCLTQFLEDKYPDMPTEGI KNLVDFLTGEEVVCRVARNLAVK 
QLTI^F^FPVPPAVUKJTFFAVIGALLQSSGPBRTALFIRDFL I 
TQMTGKKLFEMWKI INP«GLXVEELKKRNVSAPESRLTRQSG\A 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue o£ 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^AIanine, C«Cysteine, D-Aspartic Acid, E- 
Glutamic Acid, F«Phenylalanine , G=Glycine , 
H-Hiotidine, I=Isoleucine, K-Lysine , 
L=Leucine, M=Methionine, N^Asparagine, 
P=Proline, Q=Glutamine, R=Arginine , 
S=Serine, T^Threonine , V^Valine, 
H=Tryptophan, Y=Tyrosine, X=Uhknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PTALPLY FVGLYOTKKLIABGPGETVIjVAE B EAARVALRKLYGF 
TENRRP WNYS KPKETLRAEKS I TAS 


6089 


3 


3054 


TRLGIPGSTISSRPRLOU^AAEGHFI^HSWTGSRAGAHTGAPAW 
PSRRIjRDLPAGGMWRLRRAAVACKVCX}SIiVKHSSGI kgslplqk 
LHLVSRSIYHSHHPTIjKLQRPQLRTSFQQPSSLTNLPLRKLKFS 
P I KYGYQPRRNFtt PAPJLATPJLLKI^Y^ 

QWKDMI PDLSEYKWI VPDIWE IDE YIDFEKIRKALPSSEDLVK 
IAPDPDKIVESI^LLKDPFTSGSPEETAFRATDRGSE^DKHFRK 
VS D KEK I DQIvQ EELLHTQLKYQR I LER LK KENKKLR K L VLQ KDD 
KG I PPIESLRKSl^DMYSBVl^VIiSDYDASYimjtMII.PRVVWG 
DQS AGKTS VIiEMIAQAR I FPRGS G EMJfTRS P VKVTLSEGPHHVA 
LFIO)SSREPl)LTKEKDI^ALRHEIELRMRKNVKEGCrVSPETIS 
I^VftGPGIXJPJ^VDLPGVINTVTSGN^ 

DPNAI ILCIQDGSVDAERS IVTDLVSQMDPHGRRTIFVLTKVDL 

AEKNVASPSRIQQIIEGKLFPmALGYFAVVTGKGNSSBSIBAI 

REYEEEFFQNSKIiLKTSMIiKAEQVTTRNI^t^^ 

VEQXJADSFKATRFNIjETEWKNKYPRLRBIiD^ 

BVISI>3QVTPKlIWEEIL00SLWBRVSTIIVIKNIYLPAAQ*rMNSG 

TFNTTVDII^OWTDKQLPNKAVEVAWETl^BBFSRFMTEPKGK 

EHDDIFDKLKEAVKEESIKRIIKWNDPAEDSLRVIO^NAIJeDRSI 

SDKCK^nAAIYFMEEALQARLKDTEN^ 

KNRTQBQCVHNETKNELEKMIJCCNEEHPAYlASDEITTVRXNIiE 
SRGVBVDPS I>I KDTWHQVYRRHFLKTALN^^ 
FVDSKbECNDWLFWR I QRHLAJCTANTLRQQLTNTEVRRIiE KNV 
KEVIiEBFAEDGEKKI KLItTGKRVQIiAEDIiKKVRE I QEKLDAFI E 
ALHQBK 


6090 


194 


1560 


PVFVPAPGAVLEQAS/ASPPIATXJTVVPLQHCKIPBJLPVQAS IlT 
FKI/JliFTOLIAIjFVHYINIYKTVWWYPPSHPPSHTSIjmiblD 
FNIJjMVTTIVLGPJRFIGSIVKFJVSQ 

LTATGWSLCRSLIHLFRTYS FLNLL/FPI.LSVWDVHS VPAAELR 
P\RKTSLFNHMASMGPRBAVSGLAKSRDYIiLTI*R\RRGSSTQDS 
Q4ARTP CP/ PIIACCL^PSL I RSEVEFLKI^FNHRMKB VLVS SML 
SAYYVAFVPVWPVKinHYYDKRWSCEIJI^^ 
PASYCmiJTCAAAHI^CWQXVDPALCSNV^ 
LVKKSKNVYKAVGHYNVAI PSDVSH FRFHFFFSKPLRILNILLL 
LEGAVI VYQLYSLMSSEKWHQTISLALI LPSNYYAPFKLLRDRL 
VU5KAYSYSASPQRDU0HRFS 


6091 


3279 


412 


SSRTREMBEKEILRRQIRLLQGLIDDYKTLHGNAPAPGTPAASG 
WQPPTYHSGRAF8ARYPRPSRRGYSSHHGP3WRKKYSLVNRPPG 
PSDPPADHAVRPLHGARGGQPPVP<}QHVLKROV0I>S0GQtWI K 
VKPPS KSGSASASGAQRGS LEE FEDTPWSDQRPREGEGEPPRGQ 
LQPSRPTRARGTCS VEDPLLVCQKEPGKPRMVKS VGSVGDS PRE 
PRRTVSESVIAVKAS FPSSALPPRTGVAJ^RKDGSHSYASCAPQ 
LLGDRRVDAGHTDOPVPSGSVGGPARPASGPRQAREASLVVTCR 
TNKFRKN^TVnKWVAAS S KS PRVARRAI^PR VAAEtTVGKASAGMAN 
KVEKPQLIA0PEPKPRKPATS3KPGSAPSKYKNKASSPSASSSS 
SFRWQS FJW33KI)HASQI>S PVLSRB PSGD\RPAVGHSGLKPLSGB 
TPLSAYKVKSRTKI I RRRGSTS IiPGDKKSGTS PAATAKSHLS LR 
RRQALRGK5SWLKKTPNKGLVQVTTHRLCRLPPSRAHLPTKEA 
SS LHAVRTAPTSKVIKTRYRIVKKTPASPLSAPPFPLSLPSWRA 
RRI^I^RSLVLNRliRPVASGG^KAQPGSPWWRSKGYRCIGGVLY 
KV^SANKLSKTSGQPSDAGSRPLLRTGRLDPAGSCSRSLASRAVQ 
RSLAI IRQARQRRBKRKKYCMYYMRFGRCNRGERCPYI1IDPEKV 
AVCTRFVRGTOCKTIXSTC^FSHHVSKEICMPVCSY 
CPYSHVYVSRKAF^C^DFI^GYCPLGAKCKKXlITrjiCPDFARRG 
ACPRGA)0<^LLHRTQKRHSRRAATS PAPGPSDATARS RVSASHG 
PRKPSASQRPTRQl'PSSAALTAAAVAAPPHCPGGSASPSSS KAS 
SSSSSSSSPPASIJDHFJ^SLQEAAIJkAAC^NRLCKLPSFISLQS 
SPSPGAQPRVRAPRAPLTKDSGKPLHIKPRL 


6092 


143 


3190 


AKAPPTGES S EPEAKVI»HTKRLYRAVVEAVinajDL»II^NKTAYQ 
BVFXPBNISIiRtfKLREIjCVKLMFI^^ 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine, th*Aspartic Acid, E- 
Glutamic Acid, F=Phenylalanine f G^Glycine, 
H=:Histidine, I=Isoleucine r K=Lysine, 
L*Leucine, M-Methionine , IteAsparagine, 
P=Proline, Q^Glut amine, R=Arginine, 
S=>Serine, T=Threonine, V=Valine, 
W»Tryptophan, Y^Tyrosine, X-Unknown, *«Stop 
Codon, /=possible nucleotide deletion, 
\=:possible nucleotide insertion) 








VlQr.IKTNKKHIHSRSTLECAyRTHLVAGlGFyQHLLbYIQSHr^ 
QLBI^CIDWTHVTDPI,IGCKKPV5ASGKEMDV^ 
LGDLSRYQNELAGVDTBIJJU^RFYYQAI^VAPQIGMPFNQKn^L 
AGS KYYNVEAMYCYLRC IQSEVS FEGAYGNLKRLYDKAAKMYHQ 
LKKCETRKLS PGKKRCKDI KRLLVNFMlTiQSLIiQPKSS S VDSBIj 
TSLCQS VLEDFNLCDFYTjPS S PNIiS LASEDEREYKSGYAFIjPDL 
LIFQMVIICXMCVHSLERAGSKQYSAAIAPTIALFSHLVNHVNI 
RLQAELKEGENPVPAFQSDGTDKPESKEPVEKEKEPDPKPPPVT 








PQVGEGRKSRKFSRLSCIiRRRRHPPKVGDDSDLSKGFBSDSSHD 

SARASEGSDSGSDKSIiEGGGTAFDABTDSEMNSQBSRSDtBDMB 

EEEGTRSPTLEPPRGRSBAPDSIJ«3PLGPSKASIASNI«QAMSTQ 

MFQTKRCFRIAPTFSKnJUl^PTTNPHTSASHRP^ 

PAS KEG S ES EGSES SGRSCRNERS I QBKLQVLMAKGLLPAVKVF 

LDWl^TNPDLIIVCAQSSQSLWNRI^V^ 

LCPEV(^L^GCEIiPDLPSSLLLPBDMALRNLPPLRAAHRRFNF 

DTDRPLLSTLEESVVRICCIRSFGHFIARLQGSILQFNPEVGIF 

VS I AQS EQBSIJ^AQAQFRmQEEARRNRLMRmAQIJU.QI.EV 

SQLFJ3SI^QPKAQSAMSPYLVPDTQAIiCHHLPVIRQIiATSGRFI 

VI I PRTV IDGLDLbKKBHPGARDGI RYLKAE F KKGNR YIRCQKE 

VGKSFERHKLKRQDADAWTI*YKIIiDSCKQLT\LAQGAGEEDPSG 

MVTIITGLPI^NPSIO^GPMQAALQAAAHASVDIKimJDFYKQW 

KK1G 


6093 


76 


1002 


ACXJRRAMIiAIiRVART/ SRWGAL\RGAVWAPGTRPSKRRACWAIiIi 
PPWCCLGCTiAERWRI^PAAI^IjRLPGIGQRNHCSGAGKAAPR\ 
PAAGAGAAABAPGWWGPASTPSLYENPWTIPNMLSMTRIGIAP 
VLGYLI IEBDFNIALGVFALAGLTDLLDGFIARNWANQRSALGS 
AU)PI^K^LISIIiYVSI,TYADI>IPVPLTYMIISRDVMLIAAVF 
YVRYRTL PTPRTIiAKYFNPCYATARLKPTFI SKVNTAVQli I LVA 
ASLAA P VFNYADS I YLQI LWCFTAFTTAASAYS YYHYGRKTVQV 
IKD 


6094 


23 


1010 


PFI^CI^GPQKAKMSERKVimYYPPDFDPSKIPKLKI>PKiJRU!< 
VVRLMAPFNMRCKTC^BYXYKGKKFNARKB^^ 
FYIKCTRCIJ\BITFKTDPE1« , DYTMBHGATRNFQAEKLLEEEEK 
RVQKEREDEEIiNNPMKVI*ENRTKDS KLEMEVLENLQEL KDLNQR 
QAHVDFEAMLRQHRLSEBERRRQQQBEDEQETAALLKEARKRR1, 
LEDSDSEDBAAPSPIX)PALRPMPTAlIiDEAPKPKRKVBVWEQSV 
GSLGSRPPI»SRLVVVKKAKADPDCSNGQPQA/ APHPRS PAEQEG 
GQPYTPDAWRVLPJEPTGCIPGQ 


6095 


1 


1599 


trgraajbrsrgrghgflgggfaSsvvdyfpsedfyrcgycknes 
gsrsngmwahsmtvqdyqdl idrgwrrsg kyvykp vmnqtccpq 
ytircrplqfqpskshkkvlkkmlkflakgevpkgsce\depmd 
stmddavagdfalinkldiqcdlktlsddi keslesegknskke 
bpqei»lqsqdfvgeklgsgepshs 



TRADOCS:1416257.!(%CSH01 f.DOC) 



445 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C= Cysteine, Daflspartic Acid, B= 
Glutamic Acid, F-Phenylalanine, G=Glyeine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N-Asparagine, 
P=Proline, Q=Glutamine f R~Arginine, 
S^Serine, T-Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, X^Unknovra, *^Stop 
Codon, /^possible nucleotide deletion, 
\ possible nucleotide insertion) 








VKyHTVPKPGKRADLSKPPCRKAKEIRKBRKRLKJbMQQNPAGEL 
BGFQAQGHPPSLFPPKAKSNQPKSLBDLIFKSLPENASHKLEVR 
VWSSPPSSQFKATLLKSYQVYKRYQMVIHKNPPDTPTESQFTR 
FX*CSS PLEAETP PNGPDCGYGS FHQQYWIiDGKT IAVGVIDIIjPN 
OTSSVYXYY1)PDYSFLSI£VYSALREIAFTRQLIIEKTSQI>SYYY 
MGFYIHSCPKMKYKGQYRPSDLLCPBTYVWVPIKQCLPST.KNSK 
YCRFNQDP EAVDEDRSTB PDRLQ VFHKRA I M PYGVYKKQQKDPS 
E BAAVLQ Y AS LVGQKCS ERMLLFRN 


6096 


2277 


S75 " 


QRVRiUUiSSi^HDSKAI^FKHMGlJ>PRIJ^VTDLGWSRPTLI 
QBKAI PLALBGKDLLARARTGSGECTAAYAI PMLQLLLHRKATGP 
VVEOAVRGLVLVPTKBLARQAQSMIOX3IiATYCARDVRVANVSAA 
BDSVSQRAVLMEKPDVWGTPSRILSHLQQDSLKLRDSLELLVV 
EEADLLFSFGFEEHLKSljLCmiPRIYQAFIJ^SATFNKDVQALKE 
LILHNPVTtiXMESQLPGPIXiLQQ^ 

LSLI RGKSLLFVNTLKRS YRI^RLFI^QFSIPTCVLNGKLPLRSR 
CHI I SQFNQGFYIX2VIATDAEVIX1APVKGKRRGRGPKGDKASDP 
EAGVARGIDFHR^SAVXNFDLPPTPEAYIHRAGRTARANNP^ 
LTPVLPTEQFHLGKIEELLSGKNRGP ILLPYQ PRMBE I EGFRYR 
CRDAMRSVTKQAI REARLKK I KRBLLJIS EKLKTYFEDN PR \ DLQ 
LLRHDLPI^PAVVKPHIjGEIVPDYLVP PALRGLVRPHKK\GRSCIj 
PLVGRPREQSPRTHCAASSTKKRKSDPQPSPPEWGP1.WS 


6097 


1673 


192 


APGTMSGGKKKSSFQITSVTTDYEGPGSPGASDPPT*PQPPTGPP 

PRXPKGEPSPDPGGKGTPRNGSPPPGAPSSRFRVVKLPHGLGEP 

YRRGROn"C^/DVYERDLE PH S FGGLLEG I RGASGGAGGRSLD3RL 

ELZVSLGIX5APTPPSGLS0GPTSWLRPPPTSPGPQARSFTGGLGQ 

LVVPSKAKABKPPLSASSPQQRPPEPETGESAGTSRAATPI*PSri 

RVEAEAGGSGARTPPI^RRKAVDMRLRMEIiGAFEEMGQVPPLDS 

RPSSPAIiYFTHDASLVHKSPDPFGAVAAQKFSLAHSMLAISGHL 

DSDDDSGSGSLVG IDNKIBOAJWDLVKSHLMFAYREEVEVLKEQI 

RELABRNEAALEQBNGI*LRAIA\SPEQLGSAGPPRGVPR^ 

PNGPFVLSU>SLTIVPI^LPGlASAAWPPLPMPALrVPVFPGVG 

VQALSNGPWSPGPLPHLLI IPSLDGGGEGFRTGRQQGAPFGEBT 

QPPPSLPGTPQQ 


6098 


168 


1074 


NYCUiHRSPLEKDSSPGSSSTSIJ^IKKQRETSDTPIMRALKELD 
EGKtPKHW3TQ^]CBDTSNII9PR^ 

RIiNSAS QRS S3LP PSNRKS ST P TKRE I MLTPVTVAYS PKRS P KK 
NIiSPGFS HLLS KNES»S PIRFD I LLTOIJXIVPVSTLQRTOPRKQIj 

\qflplddseekX tysekat\dni VNHSSCPEPVPNGVKKVSVR 
TAWEKKKS VS YEQCKPVSVTPO<5NDFEYTAKIRTLAETERFF\D 
ELTKEKDQI EAALSRMPS PGGR I TLQTRLNQEAFGRS FGKD 


6099 


168 


1074 


NYCLRHRSPLEKDSS PGSSS TSLL I KKQRETS DTP I MRALKKLD 
EGKI FKNWGTO^KEDTSNINPRQTETS VNASRSPEKCA^ 
RLNSASQRS SSLPPSNRKSSTPTKRBIMLTPVTVAYS PKRS PRE 
WLS PGFSRIujSKNESS PI RFDILIODLDTVPVSTLQRTNPRKQL 
XyriiViJiJLJaiisK. \ x ibJSWil \ U W X V NH 5 i> LLPK Pv PNG VKKVS VR 
TAN3K^KSVSYF^CKPVSVTPQGhroFEYTAKIRTLAETERFF\D 
BLTKEKDQIBAALSRMPSPGGRITLffnmiQEAFGRSFGKD 


6100 


2 


713 


FVEVSG YRSRADPEPRGRm MTYAYLFKY 1 1 1 GDTGVGKSCIiTjL 
QFTDKRFQPVHDLTIGVEFGARMVNIOGKQIKLQIWDTAGQESF 
RS ITRSY^GAAGaLLVYDITRRETFt^TSWLETMQHSSSNM 
VI ML X GNKS DLESRRDVKREEGEAFARE \HGL I FMETS AKTACN 
VEEAFINTAKE I YRKI QQGLFDVHNEANG I RIG PQQS I S TS VGP 
SASQRNSRD IGSNSGCC 


6101 


1 


1399 


FRGO^VTPLRj^SHWIiGCRRV^WSASV^RiPALSARi^ 

GKMVFPLSCAVQQYAWGKHGSNSEVARLL^SDPIAQI^ 

AKLWMGTHPRGDAKILDNRISQKTLSQWIAENQDSIi3SKVKDTF 
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SEQ 
ID 
NO i 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B=i 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine f I=lsoleucine, K=*Lysine, 
L=^eucine, M ^Methionine, N=Asparagine , 
P=?roline, Q=Glutamine, R=Arginine, 
S -Serine, T=»Threonine, V«Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, **=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








KGNI^FLFKVLSVBTPLSIQAHPNKELAEKLHJjQAPQHYPDANH 
KPBJWIALTPPQGI^PRPVEEIVTFLKIOTBFQFLIGDEAATH 
LXQTMSHDSQAVASSZfQSCFSHXJIKSKKKV^ 

qaaagnnmedi pgklllqlhqqypgdigcpai yfkniiltkkpgk 

amflbanvpiiaylkgtktvecmacsdntvragltp 

mlsytpssskdrlflptrsqedpylsiydppvpdfttmka\evp 

G\SVTEYKDLALDSAS ILLMVQGTVIASTPTTaTPi plqrgGVL 
FIGANBSVSIiKLTBPKDIiIiIFRACCrji 


6102 


70 


2415 


QTPQ ATLAANGAKDSRGGEML PAGK IGAS PAAPCCS ES GDERKN 

LEEK5DINVTVLIGSKQVSEGTDNGDLPSYVSAPIEKBVGNDLK 

SLKKLDKLIEQRTVSKMQLEEQVLTISSEIPKRIRSAIjKNAEES 

KQPLNQFLEQETHLPSAIWSHLLTAQPWMDDLGTMISQIEEIBR 

HTjAYIiKWISQIEEIjSDNIQQYLMTNNVPBAASTLVSMABLiDIfaj 

QESSCTHLIfGFMRATVKFWHKIIjXDKIiTSDFEEIIJlQLHW 

PPQS<nVGLSRPASAPEiySYLBTLFOQU^KL(^HEI,LTEPK\ 

HSQKNTTiJ^PLLSS/WPIQWLTPIfl^^ 

KPt^Yl^QVLMWIGNHTEFI J DKKIOPII^KVGSLVNARI>EFCRG 

LMMLVI^KIjATO IPCLLYDDNLFCHLVDBVIjLF^PJBI^S VHGYP 

GTFASCMH1 LSEETCFQRWLTVERKFALQKMDSMLSSEAAWVSQ 

YKDIT^BMKVPDCAETFWLlAVITDRYKNLPTASRia^QFIvE 

LQI03LVDDFRJRI/It2VMKEETRASLGF^ 

DWADNVFTXQITOAALEVFAENNTLS KLQLGQLASMESS VFDDM 
XNLIjERIiKHDMIjTRQVDHV FREVKDAAKLYKKERWLSL PSQSEQ 
AVMSLSSSACPIJ^LTLRDHLLQLEQQLCFSLEKI FWQMLVEKLD 
VYIYQEIIIJUfRWEGGAAQIXJFDMTRNLPPLFSHYCKRPENYF 
KJIIKKAClVI^I^VGSALTAGKDVLPVQIiOGSFPAT 


6X03 


207 


2523 


ESNSTMTTYLBFIQQNEERDGVRFSWNVWPSSRIiEATRMVVPVA 
AL FTPLKERPDIiP P I QYE P VLCS RTTCRAVLN PLCQVDYRAKIiW 
ACN FCYQRNQFPPS YAGI S ELNQPAELLPQFS S I BY WliRGPQM 
PLI FIiYVVDTCMEDKDLQALKESMQMSLSiLPPTALVGLI TFGR 
MVQVHELGCEG I S KS YVFRGTKDLSAKQIiQBMLGLS KVP VTQAT 
RGPQVQQPPPSmFIJQPVQKIDMNLTDLLGELQRDPWPVPQGKR 
PLRSSGVALS IAVGLLECTFPNTGAR IMMF IGG PATQG PGMWG 
DELK^PIRSWHDIDKDNAKYVKKGTKHFKALANRAATTGHVIDI 
YACAIJDQTGIiLBHKGC PNLTGGYMVPKJDSFNTSIiFTCQTFQRVFT 
KnMHGQPKMGFGGTUS IKTPR\EI KISGAXGPCVSIjNSKGPCVS 
ElffiIGTGGTCX)WKICX^PTTTIAIiTEVVNQHNAPIPOGG\RG 
A\ IQ FVTQY \QHSSGQRR IRVTT I ARN\ WADAQTQ IQN LAAS FD 
0KAAAIIiM7VRLArYRABTEEGPDVLRWLDRQTjIRIX}QlG?GKYlIK 
DDPSSFRFSETFSLYPQFMFHLRRSSFLQVFNNSPDESSYYRHH 
FMRQDLTQSLIMIQPILYAYSFSGPPBPVLLDSSSILADRILLM 
DTFFQIL I YHGET IAQWRKSGYQDMPE YBNFRHXiIjQAPVDDAQE 
IIiHSRFPMPRY IDTEHGGSQARFIJ^KVNr^^ 
GAPILTDDV5LQVFMDHLKKLAVSSAA 


6104 


124 


732 


KVSBYI I LSKI)KI LFHALAMLVLWSPNSAARGVLRNYWERLIiR 
KI^SRPGFPSPPWGPAIAVQ\^PCLQSQQMIPVEVKRI/RSL 
LDS I FWMAAPKNRRTIEVMRCRRRNPQKH KVKNNI DVCPECGH 
LKQKH^CAYCYEKVOCETAEIRRQIGKQEGGPFKAPTIETVVL 
YTGETPSEQDQGKRI I ERDRKRPSWFTQN 


6105 


3 


9B9 


plhgactslvlojlfchrrprpcaparpfj)mrrpaavplllllef 
gsorakaatacgrprmlnrmvggqdtqegewpwqvs iqrngshf 
cggsl i aeqwvltaahcfrnts btsl yqvij/5arqlvqpgpham 
yarvrqvesnpi»yo^assadvalvei^apvpftnyilpvclpd 
psvifetgmncwvtgwgspseedllpepri^ioavpiidtXpr 

CmJjYSKDTEEGYQPKTlKNPMLC7VGFEEGKKDACKGI>SAGPLV 

clvgoswloagviswgp^carqnrpgvyirvtahhnwihriipk 
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SEQ 

ID 
NO; 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

;>mi nn arid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, B=> 
Glutamic Acid, F=Phenyl alanine r 0=Glycine f 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N==Asparagine , 
P=Proline / Q^Glut amine, R=Arginins, 
S -Serine, T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possjU>le nucleotide deletion, 
\=possible nucleotide insertion) 








LQVQPSEVGRPBVTPPGPGAP 


6106 


3 


1302 


GRP PTAPHTGRP PTANRGD PRLDLKRGCARLLTS IES RGRPAAS 
AGLRRDRCALRKWPLRRAPIARATRRRAGSPRRCAPRPRACPQG 
WSRARHQPGGLCLIJ^IJiCQFMBD^^ 
! IjYKTEIjSKEECCSTGR LSTSWTBED VNDWTLFKWMI FNGGAPNC 
I PCKKTCJ^DC^PGKKCRMNKKNKPRCVCAPDCSNITWKGPVC 
GLDGKTYRHBCALLR7ARCKBQPBLB 
STCV\VD(niniAYCOTaniICPBPASSEQYIiC(^ 
HLRKATCLLGRS IGIAYEGKC I KA3CS CEDI QCTGGKKCLWDFKV 
GRGRCSLCDELCPDSKSDEPVCASDMATYASBCAMKEAACSSGV 
LLKVXHSGSCNSrSEDTEEREEDKUQDYSPPISSILEW 


6107 


623 


168 


SRCSSPRPBPGRGRGK/I^PSKHRKWVEVFKACDEDHKGYLSRE 
DFKTAWMLFG Y3CPSKI EVDS VMS S INPNTSG I LLEGFLNI VRK 
KKEAQR YRNBVRH I FTAFDTYYRGFIjTLEDFKKAFRQVAPKIjPE 
RTVLEVFREV\DRDS\DGHVSF 


6108 


3 


1348 


GGSLRFSPPRVPS.CSRVFCPVPPGGCGbPSPPiSASRPQSPTTPW 

C1PRRYMKHKRDDGPBKQBDKAVDVTPVMTCVFVVMCCSMLVIX 

YYFYDLLVYWIGI FCLASATCIj YSCTAPCVRRLp \SASAGES A ! 

IJ^TIPNNSLPYFHiaiPQARMIXIJU^FCV^^ 

WAWVI£DAIX3IAFCLYMIJCTIIU,PTFK^^ 

FITPFIjTKSGSS IIWEVATGPSDSATRKKLPMVLKVPRIJlSSPr, 

AbCT)RPFSIJiGFGDIIiVPOIiLVAYCHRPDIQVQSSRVYFVACTI 

AYGVGIiLVTFVALALMQRGQPALIiYLVPCrXiVTSaVVALWRREL 

GVFWTGSGPAKVLPPSPWAPAPADGPQPPKDSATPLSPQPPSBE 

PATSPWPABQSPKSRTSKKMGAGAPMREPGSPAESEGRDQAQPS 

PVTQPGASA 


6109 


1 


1381 


CRSRAGAASGGAILEGTKLRRQRVDTNKPLDPLVPSALRAAMLY 
LEDYLBMI EQLPMDLRDRFTEMREMDljQV(2^ftMDQLJBQRVSEFF 
MNAKKNKPEWRBEQMAS I XKD Y YKALEDADEKVQLANQI YDLVD 
RHIJlKLDQBIAKFKMKLEADNAGITBIIiERRSLELDTPSQPV^ 
HHAHSHTPVEKRKYNPTSHHTTTDHIPEIOCFKSEALIjSTLTS^ 
SKElHTuGCRNNNSTASSNNAYNVNSSQPLGSYNIGSLSSGTGAG 
GI \TMAAAQAVOATAQMKBGRRTSSI>KASYFAFKimDFQLGKEF 
SHARETVGYSSSSALMTTLTQNASSSAADSRSGRKSKNNNKSSS 
QQSSSSSSSSSI^SGSSSSTVVQBISQQTTVVPESDaNSaVDWT 
YDPNEPRYCICNQVSYCrEMVGCiniODCPIEWFHYGCVGLTBAPK 
GKWYCPQCT\AAMKRRGSRHK 


6110 


77 


2464 


ACPSAATM S DODHSMDEMTAWKI EKGVGGNNGGNGNGGGAFS Q 
ARSSSTGSSSSTGGGGQESQPSPLAT.TAATCSRIESPNENSNNS 
QGPSQSGGTGELDLTATQLSQGANGWQI ISSSSGATPTSKEQSG 
SSTNGSNGSKSSKNRTVSGGQYVVAAAPNIjQNQQVIjTGLPGVMP 
HI QYQV1 PQFQTVDGQQLQFAATGAQVQQDGSGQIQX I PQANQQ 

I ITNRGSGGNI iaampnu^avplqglannvi^gqtqyvtnvp 

valngnitllpvnsvsaati.tpssqavtisssgsqbsgsqpvts 

gttissaslvssqassssfftna»systttt^^ 

ssgtwsqgqtpqrvsglqgsdalniqqnqtsggslqagqqkbge 

q\hqqtqaapkst^rpqlvqgg\qalx2\afqaapi^g<rrfttqa 

I SQETLQNLQLQAVPNSGP I 1 IRTPTVGPNGQVSWQTLQLQHLQ 
VQNPQAQT I TLAPMQGVSLGQTS 8 SNTTLTPIASAAS I PAGTVT 
VNAAQI*SSMPGI^TINI^AIjGT3GIQVHPIQGLPIiAIANAPGDH 
GAQLGT^GAGGTOIHDDTAGGEEGENSPDAQPQAGRRTRRFACT 
CPYCKDSEGRGSGDPGKK1CQHI CHIQGCGKVYGKTSHIiRAHLRVJ 
HTGERPFMCTMSYCGKRFTRSDELQRHKRTHTGEKKFACPECPK 
RFT1RSDHLSKIIIKTHQNKKGGPGVALSVGTLPLDSGAGSEGSGT 
ATPSALITTNMVAMFAICPEGIARLANSGINVKEGGQFCSPINT 
SANGF 
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ID 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C= Cysteine, D^Aspartic Acid, B= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K= Lysine, 
IiaLeucine, ^Methionine, N=Asparagine , 
P=Proline, Q=Glut amine, R-Arginine, 
SaSerire, T=Threonine , V=Valine, 
W=Tryptophan, Y-Tyrosine , X- Unknown, +«stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6111 


1637 


797 


RVDPRVRGAMAP WGKRIiAGVRGVIiLDI SGVLYDSGAGGGTA IAG 
SVBAVARLKRSRIiKVRFCTNESQKSRAKLVGQLQRLGFDISEQE 
WAPAPAACQILKE^GIiRPYIiIHDGV\ASEFIX3TDTS/STPNC 
WIADAGESFSYQN^5NHAPQVI2^LEKPVLISIiGKERyYKBTSG 
LMIiDVGPYMBUMiEYACXS I KAEVGGKPSPEPFKS ALQAIGVEAI *Q 
AVMIGDDIVGDVGGAQRa^RAI^VRTGKPRPSDSHHPBVKADG 
YVDNLAEAVDIiLLQflADK 


6112 


77 


196 


MSSHKSFKSKRFLAKKQKPNRPII^IWIjercWKIRfmWK 


6113 


1779 


567 


WBGRSWAACGVWIiQGAWGKRSGVRASEAESPGKRADVSWWSRQL 

TKECRKKAKPRIFHIjFNDILVYGSIVIiNKRKYRSQHIIPLEEVT 
LKIiLP ETLQAKNRWM I KTAKKS PWSAASATERQEW I SHI EECV 
PJ*QI^TGRPA\STEJJAAPWIPDKATDICMRCTQTR^ 
HCRKCRVWCAECSRQRFIiLPRLS PKPVRVCSLCYRELAAQQRK 
EEABEQGAGVPRAASHLARP I CX3R PVEMTMTPTRTRRAAGTATG 
PAAWSSTPRGWPGLPSTADPRPAEHLSPSQLHCPGPQEGSSRSC 
PGI.RDP I PWKQVQRWGVA1AGUPVPFCWTLCPYGFTAGNAFPFR 
KPQNTHRSW 


6114 


816 


246 


PTSRPRPSPG^PAMSWSACVSAAPSSSWPASSSWPCJGPRRCCTR 
RMlCSPRCttIAAGSMCSCSPSHRCTPVPACWPSPPP\PABQVlQC 
GHLPPHADRRAJjRIjPVAAP ARG PGPGHPAGPAGPRPARTPPAS P 
HQPGRPWPAPPCPLLAATEPTPSRPHQRWTREDRI'ILGRGSQVT 
GRPQWFLRjGLVIiFSL 


6115 


324 


71 


DVCGRVCAHPHbYTH IHMH 1 CAHAC \ I HTHAQLC/ ITASHALAH 
SHLYTCMV^TASHTPSHTHPHTAVHKEHRADVLRGTLTPLR 


6116 


595 


1430 


TGVMPPGRWHAA/ ISSSGPVFEGARA\LQTVKKEEEDE3YTPVQ 
AARPQTI^RP^ElFRQLPROLRYHBSSGPLETI^RLRBLCRWtV 
LRPDVLSKAQILELLVLEQFL3 ILPGEbRVWVQLHNPESGEE\L 
N PCWRS CRGTLMGHPGGTRAL P \ E PRCALDGYRS \ LRSAQI WS I» 
AS PLRSSS AIjGDHIiEP PYEI EARDFLAGQSDTPAAQM PALFPRE 
GClPGDQVTPTRSLTAQIOBlTflTFKDVEVT FSQDEWGVfLDSAQRN 
LYRDVMLKNYRNMASLGK 


6117 


1433 


222 ■ 


VGVPSPAPPCSWBVGPGGGWTPGirJCEGOWRRTPIirjATRTR 
GLLSL FPP AAMHPAAFPIiPVVVAAVLWGAAPTRGIj IRATSDHNA 
SMDFADLPAliFGATLS QEGLQOFLVBAHPDHACSP IAP? PPAPV 
NGSVFIALLRRFIX^FDLKVLNAQKAGYGAAVVHN^ 
VWN5EBIQQQIWI PS VFIGERS SEYt»RALFVYEKGAR VLr,VPDN 
TFPLGYYI*lPFTGIVGljIiiVI»RMG^VMIARCIQHRKR^ 
\EQIJCQl\PTmJYQKGr<lYDVCAICLDBYEDGDKIiRVL^ 
RERCVDP^TQTRKTCPICKQPVHRGPGDKIXJEEBTQGQEEGDH 
GEPRDHPASERTPLLGSS PTLPTS FGSLAPAPLVFPGPSTDPPL 
SPPSSPVJLV 


6118 


1044 


247 


STI3CRACT3GATPGAOSHRSARGKJyU5GKETAALGMERGKVia<: 
KEKEKETQKESIGE KGREEKVKRKEVBQKI KQBKQE KQBRRKGK 
EKEEKRTKQGKETNKP. KEQFKGQEE5Q3ENKDSTL.TRTP1>EP LEK 
NKQI OVLGIJDGAGKTSVLHSU^NRVOJISVAPTQGFHAVCINTE 
DSQMEFLB IGGS KPFRS YWEMYLSN/ ADSLARSFSVGFKQDSQP 
ITWKAKKYLHQL IAANPVLPLVVFANKQDLEAAYH ITD IHEALA 
II 


6119 


1217 


462 


DPRFVTENTTKAPAQERTTQPRSSREGTM^ 

LRSVSN ISSBPGRRA/WTSAPPPQRPFRVCDHKRTIRKGLTAATR 

QKUAKATjKTLIiNG^TLVUBEIXJTAVDSBDFFQI^EDDTCI^ 

VI/lSGQSWSPTRSGVI^YGLGRBRPKHSra 

DLFGSLNVKATFYt&»YSMSCDFQGlAGPK^^ 

GLGHMLIG ISSTIjRHAVEGAEQWQQKGRLHSY 
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SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
(A^ Alanine, OCysteine, D=Aspartic Acid. K= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H^Histidine, I=Isoleucine , KtLysine, 
L=Leucine, M=Methionine, N^Asparagine, 
P=?Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine f X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6120 


705 


179 


LERAGGGGLSSRALVGSGACI»SLVARANGKGLPRGRKEFVEAVR 

VR YVAER YRTPRAVCLRLWS CRRE VTMSGRGKQGGKVRAKAKSR 

SSRAGLQFPVGRVHRLLRJCGNYAERVGAGAPVYLA^ 

I LELAGNAARDNKKTR 1 1 PRHLQLAI RNDEBLNKLLGKVTI AQG 

G\VLPNIQAVLLPKKTKSQKDBOANDP 


6121 


1612 


107 


FVRAQARGSRQPVRRPIJjGAGSRLRCRS CGRMEPLKVEKFATAN 

RGNGLJ»VTPLRPGB1^FRSDPLAYTVCKGSRGWCDRCS»IX3KB 

KlMRCSQCRVAKYCSAKCQKKAWPDHKRBCKCIiKSCKPRYPPDS 

VRLLGR VVFKLMDGAPS ESEKL YS F YDLESN INKLTEDKKEGLR 

QLVMTFQHFMRBEIQDASQLPPAFDLFEAFAiCVICNS PTICNAE 

MQEVGVGIiYPS ISLLNHSCDFNCSIVFNGPIILLLRAVRDIEVGE 

ELT I CYLDMLMTSERRRKQLRDQ YCFECD \ CFRCQTQDKDADML 

TGDEQVWKEVQESIjKKIEEIjK7QiWKWBQVIiAMOQAIISSNSBRL 

PDINIYQLKVtUXZiMDACINI^I^^ 

SHPVRGVQVMKVGKl^LHQGMFPQAMKNIJl 

SLX EDTiI I>L LE/ AMRRQHQS ILRERSQREI RRVSLLNALLRSHT 

w:fvscvnlsywkfcsvfv 


6122 


2 


2324 


RFRKMAIXMaASQDESSAAAAAAADSRMNNPSETSKPSMESGDG 

NTGTGTNGU5FQKQPVPVGGAISTAQAOAF^ 

07VAAQSLNVQ3KSNF^GDSOX3PSQPSQQPSVXJAAJPQTQLMIA 

GGQ I TOLTLTPAOWT J J^AOAOAOTiLAAAVQQHSASQQHSAA 

GAT I SASAATPMTX3 IPLSQ PIQI AQDLQQI^QLQQQNLNLQQFV 

LVHPTT^^I^PA\0FII3OT^0G0OGIJLQA\OOTJ,TnT,Pnn^na^»J 

LLQSQPRI\TI.TSQPATPTCTIAATPIQ^PQSQSTPKRIDTPS 

I^P\SDLEELE0FAICrFKX?miIKLGFT\CX3DAGLAMVKLYGND 

FSPTTIFRFBAIjNLSFKNMC^KJ^^ 

PSALNSPGIEGLSRRRKKRTSIEA\NIRVALEKSFLEN\QKPTS 

EE3TMIADQI*NMEKGVIRVWFCNRRQKEKRINPPSSGG\TSSSP 

IKA1 FPS PTSLVATTPSLVTSSAATTLTVSPVLPL.TSAAVTNLS 

VTX7TSim , Sl!^AWISTAPPASSAVTSPSI>SPSPSASASTSEA 

SSASETSTTQTTSTPI^SPLGTSQVMVTASGIK3TA/A0LLPFKG 

AAQI.PANASIJUVMAAAAGLNPSLMAPSQF^ 

GAI^PAI>1SNSTIATI0AIASGGSLP ITSLIIATGNLVFANAGGA 

PNIVTAPIJPLNPQNI^IXTSNPVSLVSAAAASAGNSAPVASIiHA 

TSTSAE^IONSLFTVASASGAASTTTTASKAQ 


6123 


3 


2944 


HLIiHRWFGTDKG^lINFTTGE FQIiTEAC P YLGTHS EES RFGI LHL» 
HLQPLFJ^KRVGVVFTPADYGKVTSIilLIPJ^TVII^GVl^B^ 
ARELIjKVGGRLPGAGGSl^FKVPBSTIJdDCRROLKDSKQII^ 
KNFKVENIGPLPITVSSLK^NGYNCO^GFEV^DCHQFSLDPNT 
SRDIS IVFTPDFTSSIWIRDLSLVTAADIiEFRFTI^JVTIiPHHLL 
PL(^WPGPSWBESFWPJ^TVFFVSI*SI*LGVlI>lAFCmOYII^ 
EFMKTRQRO^^SSSQQIWGPMDVISPHSYKSNCKNFLDTYGPS 
DKGRGKNCLPVNTPQSR iqnaakrs patyghsqkkhkcs vyys k 
HXTSTAAASSTSTTTEEKOTSPLGSSLPAAKEDICTDAMRBNWI 
S LRYASG INVNliQIOiLTLPKNLLNKEENTLKNTIVFSNPSSBCS 
MKEGIG^CMFPKETOrKTSENTAEFKEJtRLCPLKTSKXLPBNHL 
PRNSPQYHQPDLPE ISRKNIK3NNQX}^ 

PSSEKKIHKTSREDMFSEKQDI PFVRQFJ^PYRKXKLQEKREGNL 
QNI^WSKSRTQU(NKKRGVAPVSRPPEQSt)LKLVCSDFER5E^ 
SDINVR5WCIQKSTREVC1CADAEIASSLPAAQREAEGYYQKPEK 
KCVDKFCSDSSSDCGSSSGSVRASRGSWGSWSSTSSSDGDKKPM 
VDAQHFLPAGDSVSQNDFPSEAPISmLSHNICNPMTGWSLPQY 
AEPSCPSLPAGPTGVF^KGljYSPGDIjWPTPPVCVTSSI^CrrLE 

ngvpcviqesapvhnsfidwsatcegqfssaycplblhdynapp 
kemm^angfpcpadvotdfidhnsqstwntpp\nmpas \wgna 

QFPSSSRPYLKSTPKACLPMSGLFGPI\WAP\QSDVYBNCCPIN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D~Aspartic Acid, E= 
Glutamic Acid, F- Phenylalanine , G^Glycine, 
H=Histidine, I=Isoleucine , K^Lysine, 
I>=Leucine, M~Methionine, M=Asparagine , 
P^Proline, Q-Glut amine, R=Arginine , 
S=Serine, T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyxosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\apo3sible nucleotide insertion) 








PTTEHSD /THMBNQA\WCKEYYPGF \NP FRAYMNliDIWTTT\A 
NRNANFPLSRDSSYCGNV 


6124 


1S73 


236 


SDEALHLAGERGMGRVQLFE I S LSHC3RVWSPGE PJLAGT VR VRL 
GAPLP FRAIRVTC IGSCGVSNKANDTAWVVEKGYFNS S IjSIADK 
GSLPAGEHSFPFQFIJj^ATAPTSFEGPFGICEVHQVRAAIHTPRF 
S KDHKCSLVFYILSPLNLNS I PDIEQ PNVASATKKFSYKLVKTG 
S WLTAS TDLRGYVVGQALQLHADVBNQ5GKDTSP VVASLI^ 
C WTCJVKIIW THTWPT T AFVEGAfSVTCAWRRAOW HEOHjVPAL»POSAI» 
PGCSLIHIDYYLQVSIiKAPFJVTVTLPVFIGNIAV/NPCPSBPPA 
RPGAASWGPTPGG\PSAPPQEEABAEAAAG6PHFLDPVFIjSTKS 
HSQRQPLL&TLSS VPGAPEPCPQDGS PASHPLHPPLCISTGATV 
PYFAEGSGGPVPTTSTLILPPEYSSWGYPYEAPPSYEQSCGGVB 
PSLTPBS 


6125 


1 


904 


KTCP KLTCAFTVS VP DS CCRVCRGDGELS WEHSDGD I FRQPANR 

KAKits i HKo Ji i Dec tr oKV2ouurljf>K r i^j»KonittsHl*U/oyyiwv» X 

IVQIVTNNKHKHGQVCVSNGKTYSHGESWHPNLRAFO I VECVLC 
TamTCUKCKKlHCPNRYPCXYPQK^^ 

PGQS FIMKGYFCCKETMPVYES VFMEIXJETTRKIALETERPPQV 

BVHVWTnUCGIWJHFHIBKISKR^F^ 

FTEGEAQISQMCSSRVCSTOLBDLVKVLYLBRSEKGHC 


6126 


1224 


389 


RIJjSEAPCPRSRRRFQMNPEWGOAFVHVAVAGGLCAVAVFTGIF 

■ i>*v?> »yM»' i^ftf n f\ivMfr»vj\i.Ti.>> , *\r iiHWfrwanvHPifti 1 jLtXjvvi-u n u 

HRGGAMGIK?PRYLKDVFAAMALI*YGP^MUILWTQWRRAAVI^ 
WLTLP I FAW P VAWCXYLDRGWRP \ WLFLSLECVSLAS YGIiALLH 
PQGFEVALGAHWPAVGOALRT \HRHYG/SATPSATYLALGVI*S 
CI/3FV\n^CDHQIJmWRLPQCXTGH^ 
THFNTHPRFHPSGGKTR 


6127 


1335 


463 


VLPRRCLVFVVNTM DS SR E PTLGRIiDAAG FWQ VWQR FUADEKG Y 
lEEKELDAFFIiHMUtia.GTDDri^ 
GRIRMKELAGMFLSFJIENFIjI^FRRENPLDSSVKFMQ^ 
0SSGF1SAA^IiRNFLRDI»FLHHKKAXSBAKI»EBYTGTMMKX FDR 
NKIX3R I^LNDIJmilJWENFLLQFKMDACSTB KRKGDFEKI FA 
YYDVS KTCALEGP \EVXX5FVKDMr-tELVQPS ISGVDLDKFRE I LL 
RHCDVNKDGKIQKSKLALCLGLKINP 


DJL/O 


2*511 


843 


tcrmsrrqi^wvwssox?vqargrnvraprlgkiamglemsskd 

SPGSI»IX3RAWBDAQKPQSAWCG*3RKr^^ 

GGAARPKKTABBGPPAAPGSLRHSGPLGPRAGPTAIiPKPQVTSA 

MSSQWG IEPLYI KAEPAS PDSPKGS5BTETEPPVAUVPG\ PAP 

TRCLPGHKJffiEIXJEGAGPGEOjGGGKLVLSS 

GYHYGVASCEACIGVFFiaiTIQ^SIBYSCPASireCTITKRRRKAC 

QACRFTKjCI^VGMLKEGVRIjDRVRGGRQKYKRRPETO 

FPAGPIAVAGGPRKTAAPVNALVSHLLVVEPBKLYAMPDPAGPD 

GHLP AVATLCDLFDREIVVT I SWAKS I PGFSSLSLSDQMS VLQS 

VHMEVLVLGVAQRSLTLQDELAFAEYIiVU3EEGARPAGLGELG\ 

AALLQLVRRIiG^LRIiFJre^ EDEPRLWS 

SCKKr,T 1 HKArJ.EYKAGRAGPGGGAERRRAGRLLLTI.PI,r.RQTAG 

KVIiAHFYGVKLPXJKVPMHKXPLBNCLEAMMD 


6129 


1764 


771 


ARFARSAHEGKMPKKKTGARKKAENRRERKKQLRASRSTIDIiAK 
HPOTAS^CDKCQRRQKNPAFCYPCNSV^ 
KSSDCV1KHAGVYSTGLAMVGAICDFCEAWVCHG 
CPLTDAEC\VECERGVWDHGGRI FSCSFCHNFLCEDDQFEHQAS 
CQVTiEAET P KCVS CNRLGQHS CLR CKACFCDDHTRS KVFKQEKG 
KQPPOTKCGHETQETKDLSMSTRSU^^ 
YWKNLSSDKTCDTSYHDEBEDBYEAEI)DBBBE13BGRKDSDTESS 
DLFTNLNLGRTYASGYAHYEEQBN 


6130 


3 


577 


GRGGTMREYKVVVLGSG\GVGKSALW\QFVTCTF1EKYDPTIE 
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3EQ 
ID 

VtKJZ 


Predicted 
beginning 
nucleDt ide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
Sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, 6«Glycine, 
H=»Histidine, 3>Isoleucine , K-Lysine, 
L«Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutaraine , R=Arginine, 
S=Serine, T= Threonine, V= Valine, 
W=Tryp t ophan , Y=Tyrosine, X=Unknown , *=Stop 
Codbn, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








dfyrkbiev\dsspsvagiswtqqgteqf\asmrdi,yikkgqgc 
ilvyslvnqqspq\dikpmrdqirrvkvsekvpvt\i,vgn\svd 
les erev3ssegralaee wgcpfmktsaxsktmvdelfae i vrq 
mnyaaqpdkddpccsacniq 


6131 


3 


j 1811 


SSPREKTSDSSHRPSRHGPLFLRLVGLSPFSYLCVPPSRPVPGS 
PRSLSAMRLLPLAPGRLRRGSPRHLPS CS PALLLLVLGGCIjGVF 
GVAAGTRRPNVVIiIJ,TDIX)DEVLGGMTPLK^ 
SAYVPSALCCPSRAS ILTGKYPHNHHVVNNTLEGNCSS KSWQKI 
QE PNTFPAI IjRSMCGYO/TFF\AGKYLNEYGAPDAGGLEHVPLGW 
SYVTVAI*KKNSKYYNYTIiSIKGKARKHGEHYSVDYLTDVIJU^ 
DFliDYKSNFEPFP*MATP\APHSPWTAAPQYQKAFQNVPAPRN 
KNFNIHGTNKHWLIRQAKTPMTNS S IQFLDNAFRKRWQTLLSVD 
DLVEKLVKRLEFTGELNNTYI FYTSDNfG YHTGQFSIiP I DKRQLY 
EFDIKWPLLVRGPGIKPNQTSKMbVANXDLGPTILDlAGYDIjNK 
TQMDGMSLLP ILRGASNIjTWRSDVLVEYQGEGRNVTD PTCPSLS 
PGVSQCFPDCVCSDAYNNTYACVRTMSALW^ 
B\nfI^I3tf>PDQITNIAKTIDPE^ 
PGVFDPGYRFDPRLMFSNRGSVRTRRPSKHLL 


6132 


96 


1241 


aagllppglvpedprrtrnllpfgiqgppfalsrplfscvesgw 
awfambpeflydli^lpkgveppaeeblskggkkkylpptsrkd 
pkfbelqkpa\vuibwinatijipbhrvvr5lkedmp15glrxh^ 

FQRIAALKIiEAEDIALTATSQKHKLTVVLBAVNRS\C6WRSGRP 
SGA/WESlFNKDLLSTLHLLVALAKRFQPDLSLPThTVQVEVITI 
ESTKSGLKSEKI»VSQI»TEYSTDKDEPPia)VPl)BLFKLAPBKVNA 
VKKAI Vl^VNQKI^RI^l^ VQNIJ5TQFADGVI LLIJ^IGQLEGFF 
IJn*KEF^TPNSPAEMI,HNVTLALELL/IGRGPAQLPC/LALK/ 
TTVNKlJAKSTUiVIiYGLFCKHTQKAHRDRTPHGAPN 


6133 


2 


4256 


FVHGSMADTDIiFMBCEBEELBPWQKI SDVIBDS WEDYNS VDKT 

TTVSVSOXJPVSAPVPIAAHASVAGHI^TSTTVSSSGAQNSDSTK 

KTLVTLlANNNAGNPLVQQGGX)PLILTONpAPGliGTMV^ 

PVO^QHANHVTSSPVASQPIFITTQGFPVRNVRPVQNAMNQVG 

rWWG^XKJTWPITLVPAPGTQFWPTVGVPQVFSQMTPV^ 

STMP VRPTTNTFTTVI PATI/T IRSTVPQSQSQQTKSTPSTSTTP 

TA0X}PTSU^IJWQSPGQ^OTTNPKLAPSFPSPPAVSIASFV^ 

VKRPGVTGENSNEVAJG^VNTrJ*TIPSLGQSPGPVVVSNNS3AH\ 

GSOJRTSGPESSMKVTSSIPVFDLQIX^RIQOTRCNAQFRVTKAI* 

RGHMCYOZPEMVEYQKKGKSlJJSBPSVPSAAKPPSPEIcrAPVAS 

/ THPSSTPIPALSPPY/TKVPE PNENVGDAVQTKLIMLVDDFYY 

GRIX^KVAQLTNFPKVATSFRCraC^ 

IJJQQNGEVDGHTICQHCYRQFST PFQI^HLENVHS PYES'lTKC 
KI CBWAFES EPLFLQHMKDTHKPGEMP YVCQV^ 
VHFRMIHEDTRHIJX:PYC!I,KVFKNGNAFQQHYMRHQKR\W 
CNKCKVQPLFAKDKIBHKI^HHXTFRKPKOLEGLKPGTKVTIRA 
SRGQPRTVPVSSNDTPPSALQEAAPLTSSMDPLPVFIiYPPVQRS 

CCSRAYANHMINI^PRKSPKTIiAIiFKNSVSGIKIiACTSCTFVT 

S VGDAMAKHLVFNPS HRSS S I LPRGLTWXAHSRHGQTRDRVHDR 

WVKNMYPP PS FPTNKAATVKSAGATPAB PfiBLLTPLAPALPS PA 

STATPPPTPTHPQALALPPLATEGAECLNVDDQDKGS PVTQEPE 

LASGGGGSGGVGKKBQI^VKJCLRVVLPALCCNTO 

RRIRRWLPJIFQASO^ENLBGKYLSFEAEEKIAEWVLTQREQQLP 

VNEETLPQKATKJGRSLEGGFKISYEWAVRFMLRHHLTPHARRA 

VAKTLPKDVAENAGLFIDFVQRQIHNQDLPLSM I VAIDB I SLFI* 

DTBVLSSDDRXENAliQTVGTGEPWCDWI^ 

RGQMIXJPANMPDSILIjEAKESGYSDDEIMFXWSTRVWQKHTACQ 

RSICMLWIIX^THLSEEVLAMI^ASSTLPAVW 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A~ Alanine, C=Cysteine f D=Aspartic Acid, E= 
Glutamic Acid. F«Phenyl alanine, G^Glycine, 
H-Histidine, I»lsoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S^Serine, T=Threonine, V= Valine, 
W-Tryptophan , Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








DVCIKKTVKNPLHKKWKEQAREM^^ 

LGVIGDCPELVQRSFLVASVLPGPIX5NIHSPTRNADMQEELIAS 
LEEQLKLSGKHSESSTPRPRSSPEETIEPKSLHQLPEGESETES 
FYGFEEADLDLMBI 


6134 


2 


4256" 


FVHGS MADTDLPMECKKEELBPWQKI SDVI ED S WEDYNS VDKT 
TTVSVSQQPVSAPVPIAAltASVAGHIiSTSTTVSSSGAJjJNSDSTK 
KTLVTI»IANNNAGNPLVQQGGQPL I IjTQNPAPGI^TMVTQPVLR 
PVQVMQNAWlVTSSPVASQPIFirrQGFPVimVRPVQNAM^ 
IVLNVQQGKnVRPITLVPAPGTQFVKJ?T^ 

S1MPVRPTTMTFTTVI PATLTIRSTVPQSQSQQTKSTPSTSTTP 
TATQPTSLGQIiAVQSPGQSNQTTNPKLAPSFPS PPAVS IASPVT 
VKRPGVTGENSNEVAKLVNTLNTI PS LGQSPG P VWSNN S SAH\ 
GSOJiTSGPESSMKVTSS IPVFDLQDGGRKICPRCNAQPRVTEAL 
RGHMCYCCPRMVEyQKKGKSLDSBPSVPSAAKPPSPKKTAPVAS 
/THPS3TP IPAI^PPY/TKVPEPNENVGDAVQTKLIMLVDDFYY 
GRDGGKVAQLTOTPKVATSFRCPHCTTOUiKNNIRFMNra 
LDCXiNGEVDGHTI CQHCYRQFSTPFQIiQCIHIfiNVHSPYESTTKC 
K^CEWAFESEPLFI^HMKDTHKPGEMPYVCQ^^ 
VHPRHIHEDTRHXJiCPYCIJCVFlCNGig^^ 
CNKCRVQPXFAKDIGCEHKLQHHKTPRKPKQLEGbKPGTKW 
SRGQPRT\a^SSNDTPPSAI/)EAAPIiT3SMDPLPVFIiYPPVQRS 
I QKRAVRKM3 VMGRQTCLBCSFB XPD FPNHFPTYVHCSIiCRYST 
CCS RAYANHM INNHVPRKS PKYLAIiF KNSVSG I KLACTS CTFVT 
SVGDAMAKHLVFNPSHRSSSILPRGLTWIAHSRHGO/TRDRVHDR 
NVKNMYP PPS FPTNKAATVKSAGATPAK PEELIiTPLAPALPSPA 
STATPPPTPTHPGAIiALPPLATEGAECLNVDDQDEGSPVTQEPE 
LASGGGGSGGA7GKKEQI£VKIO^VVI.F^^ 
RRIRRWI^FQASQGENLEGKYI^FBAEKKLAEWVLTQRBQQLP 
VNEBTLFQKATK I GRSLEGG PKTS YEWAVR FMLRHHLTPHARRA 
VAHTLPKDVAENAGLFIDFVQRQIHNQDLPLSNIVAIDEI SLFL 
DTEVLSSDDRKENALQTVGTGEPWCDWLA1 
RGQMDQPANMPDS IIxCiRAKESGYSDDE IMKLWSTRVWQKHTACQ 
P^KG^VMDaiRTIII^EFAn^AMLSASS-IXPAVVPAGCSSKIOPL 
DVC I KRTVKNFLHKKW KEQAREMADTACH3SDVIJjQLVLWLGEV 
LGVIGJX!PBLVQRS FLVASVLPGPDGN I NS PTRNADMQEELIAS 
LEEQLKLSGEHSESSTPRPRSSPEETIBPESLHOLFEGBSETBS 
FYGFBEADLDLMEI 


6135 


2 


4256 


FVHGSMADTDLFMKCEEEELEPWQKISDVIBDSVVEDYNSVDKT 
TTVS VSQQPVSAPVP I AAHAS VAGHLS TSTTV S S SGAQN5DSTK 
KTLVTLIANNNAGWPLVQGGGQPLILTONPAP^ 
PVQVMQNANHVTSSPVASQPIFITTQGFPVRNVRPVQNAMNOVG 
IVIJ^QGQTVRPITLVPAPGTQPVKPTVGVPOWSQOT 
STMPVRPTTNTFTTVI PATLTIRSTVPQS0SQQTK3TPSTSTTP 
TATQP^SLGQIiAVQSPGQSNQTTNPKLAPS FPSPPAVS IASPVT 
VKRPQVTGENSNKVAKXiVnlJaW 1 H?£>lA*yi>FGr v V V^filNbcuatl \ 
GSQRTSGPESSMKVTSSIPVFDLQDGG3UCICPRCNAQFRVTEAL 
RGHM CYCCPEMVE YQKWGKS LDS E PS VPSAAKP PS PEKTAPVAS 
/ THPSSTP I PAI*SPP Y/ TKVPEPNBNVGDAVQTKLIMLVDDFYY 
GRDGGKVAQLTNFPKVATS FRCPHCTKRLKNN I RPMNHMKHHVE 
LDQQNGBVDGHTICQHCYRjQFSTPFQI^CraiEl^SPYESTTKC 
KICEHAFESEPIiFLQHMKimiKPGEMPYVCQVCQYRSSLYSBVD 
VHFRM IHEDTRR1^CPYCX>KVF1WG1^F0^HYMRHQKR\NVYH\ 
CNKC^VQFLFAXDKIEHXI^QHHOT 

SRGQPRTVPVSSNDTPPSAI^BAAPLTSSMDPLFUFLYPPVQRS 
IQKRAVRKMSVMGRQTCLKCSFEI PDFPNHFPTYVHCSIiCRYST 
CCSRAYAJNHMINNHVPRKSPKYIiAI»FKNSVSGI KIACTS CTFVT 
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ID 
NO: 
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nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, Cysteine, D-Aspartic Acid, 
Glutamic Acid, F=«Phenylalanine, G=Glycine, 
H-Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N~Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine , 
S=Serine, T=Threonine , V- Valine, 
W^Tryptophan , Y=Tyrosine, X -Unknown, *=Stop 
Codan, /=possible nucleotide deletion, 
\=>possible nucleotide insertion) 








SVGDAMAKHLVPNPSHRSSSII.PRGLTHIAHSRHGQTRDRVHDR 

NvKNMYPPPSFPTNKAAXvKSAGA^ pa 

STATPPPTPTHPQALAIJ>PIiATEGABCIJIVDDaDEGSPVTQBPB 

LASGGGGSGGVGKKEQl^ VKKLRVVLFALCCNTEQAAEH PRNPQ 

R RIRRWLERFQAS QG ENLEG KYLS FEABRKLAEWVLTQR BQQJjP 

VNESTLFQKATKIGRSl^EGGFKISYEWAVRFMLRHirLTPIIARRA 

VAHTLPKD VAENAGLF I DFVQRQ I HNQDLPLSMIVAl DE ISLFL 

DTEVI^SDDRKENALOTVGTGEPWCDVVIAI IiADGTVLPTLVFY 

RGQMDQPANMPDSILLEAKESGYSDDEIMELWSTRVWQKHTACQ 

RSKGMLVMDOIRTHLSEEVI^LSASSTLPAVVP 

DVCI KRTVKNFLHKKW KEQAREMADTACDSDVLLQLVL VWLGEV 

LGV1GIX:PELVQR^FLVASVLPGPDGNINSPTRNA1)MQEELIAS 

LKEQLKt.SGKHSESSTPRPRSSPRETIBPESLHQLFEGE3ETBS 

FYGFRBADLDLHEI 


6136 


1704 


539 


FGVRMALBGNSKRKRKRSVQEGEMPDDGVRGSPPEDYKLGQVAS 
SLFRGBHHS RGGTGR LAS L FSSLEPQIQ PVYVPVPK\ ESALASA 
DLEKEIHQK(^KIUa^SQPGVKVADRiai^m-KiriWSQR:<KlQ 
INQEEERLKNER1VFVGNLPVTCN KKKLKS FFKEYGQ I ES VRFR 
SLIPAEGTI^KKLAAIKBKIHPIXJKNI^YWFKBKSAATQALK 
RNGAQI AIXJE^I RVDIASSTSSRDKRS VFVGNLP YKVE ESAIEK 
HFLDCGSIMAVRIVRDKMTGIGKGFGYVLPKNTM 
SKI^xRKI^VmSVNKEKFKMNSNPRIiKNVSKPKQQIJJFTSKT 
AEGHPKSLFIGEKAVMiKTKKKGQKKSGRPKKQRKQK 


6137 


141 


2656 


RAIJIKRRCGPGRRGAIiGSGPGPQRRPGRVPEERPAPPKKRKHPG 
MHNNLIVAMClMLLGLPGIWQ^OTH^ 

YLWQG \LFQI»YIiDEAGRGHS FSFHGAALTAPKQGQELMAKALES 

LSCPKDMAPSHCAEHKDQFLQLSQYRQL^ 

LQHAGLREAGGIFYFSVPPFAYEDIARNINSSCRPGPGAWLRVV 

LEKPFGHDHFSAO^IATEU3TFPQEBKMYRVDirfIiGKQAVAQIL 

PFRDQNRKAIjDGLWNRHHVE RVEI I MKETVDAEGRTSFYE EYGV 

I RD VLQNHIiTK V JLi TbVAM BItPHNVS S AEAVLR 1 1 KLQ V tT) ALRG L 

ORGSAVVGQYQSYSEQVRRELQKPDSFHSLTPTFAGVLVHIDNI, 

RWEGVPFILKSGKAIJ>KRVGYARILF1CNQACCVQSEKHWAAAQS 

QaiPRQLVFHIGlIGDLGSPAVLVSRNLFRPSLPSSWKEMEGPPG 

IiRLFGS PI>SDYYAYSPVRERDAHSVLLSHI PHGRKKFF1TTENL 

IASWNFWPPM,ESIJulKAPm*YPGGABWGRI^ 

QCX5PBQLVPGPGPGPMPfiDF^VLRAKYRESSLV3AWSEELlSKI, 

ANDIBATAVRAVPJ^FGQFIIIiALSGGSS PVALFQQIA FPW 

AHTHLWLVDF^CVPLSDPKSWFQGIiQAIILLQHVRI PYYN1H\AM 

PVHI4WR1»CAEEDQGAHI YARE ISALGANSS FDLVIjLGMGADGH 

TASLFPQSPTGLIXJEQLVVIjTTSPSaPHRRHSLSLPI.niRAKKV 

AVLVMGRMKREITTLVSRVGHEPKKWPISGVLPHSGQLVWYMDY 

DAFDG 


6138 


4587 


934 


EFS KXTDRWQNAVO^VRQRKGDVDGLVRQHQDFTTS VBNLFRFL 
TDT^HLI>SAVKGQFJIFSLYQTR3LIHELKWKEIHFQRRRTTCAL 
TLEAGEKLLLTTDLKTKF^ VGRR I SQI^ I KQ 
FQSTVBTWDQCEKKI KELKSRIiQVLKAOSEDPLPBLHEDLHNEK: 
ELIKBLBQSI^WTONI»KEIXmiKADLTRHV]UVEDVMVL 
HLHRQWEDLCLR VA I R KQE IEDRLNTWVVFNEXNKELCAWIjVQM 
KNKVLG/T AD I S I EEM I EKJ^KDCMEBINLFSBNKI^LKQMGDQL 
I KASNKSRAAEI DDKLNKINDRWQHLFDVIGSRVKKLKETPAFI 
QQXJDK^SNl^TWLARrESELSKPVVYDVCDDQEIQKRlAFJDQD 
LQRDIEQH3AGVESVFNI CDVLIiHDSDACANETECDS IQQTTRS 
IJDRRWRNICAMSMERRMKIEETWRLWQK^ 
RTAACPNSSEVLYTSAKEELKRFEAFQRQIHERLTOJUBLINKQY 
RRLARENRTDTASRL KGWVHEGNQRWDNLQRRVTAVLRRI>RHFT 
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ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G -Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, fNMetbionine, N=Aspaxagine, 
P=Proline / Q^Glutamine , R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X^TJnknovra, *»Stop 
Codon, /^possible nucleotide deletion, 
\«poesible nucleotide insertion) 








NQRBBFBGTRBSIIiVWLTBMDIXJLTNVKHFSESDADDKMRQLNG 
FQQEITIi»rrNKIDQLrVFGEQLIQKSEP\LDAVLIEDEIjEEIJIR 
YCQEVFGRVSRFHRRLTSCTPGLBDEKKASENETDMEPPRB ICJT 
DSWRKRGBSKEPSSPQSLCHIiVAPGHERSGCBTPVSVDS\IPLB 

wdhtgrrggpsssh\kbdbeaqyy\sai»sgrsisdghswhvpds 
pscpbhhyk^gdrnvppvppasstpykppyaklllppgtdgg 
kegprvi^gnpqqedggi^itboo^gafdrwbmig^bl\hnk 

LKIKQNLQQLNSDISAITTWIiKKTEABLBWIjKMAKPPSDIQBIB 

LRVKRIiQBILKAFDTYKALWSVNVSSKKn»QTESPBSTE 

IMI^ UjWBAAQGAVDS WRGGLRQS tiMQCQUtllQI^ QNJUjIiWLA 

SAKNRROKAHVTDPKADPRALLKCRRELMQLEKELVERQPQVDM 

LQETSNSLLIKGHGEDCIEABEKVHVI\EKKLKQLREQVSQDLM 

AIX2GTQNPASPLPSFDEVDSGDQPPATSVPAPRAKQPRAVRTTE 

GBBBTBSP^PGSTRPQP^FLSRVVRAALPLQIiTiTiTiTiT.TJiTiAaUL 

PSSEEDYSCTQANNP\ARSFYPMLRYTNGPPPT 


6139 


52 


1131 


LGDWVWSRTCGVLETPTSVLRRARARGPCPTDSKWALPRIjRKGK 
TBRRPWKAS SWKTL /IJ\GWIGGAASVTVGHPIJ>1VKTRLQAGVG 
YGNTLS C I R WYRRBSM FGFFKGMS FPLAS I AVYNSVVFGVESN 
TQRFkSQHRCGRPEAS PPRTLSDLLLASMVAGWSVGliGGPVDL 
IKIRLQMO/I^PVSGRQPRFT8VQGSGSCG\BPAYQGPVHCITTIV 
PJJEGIJ^LYRGASAMIJJlDVPGYCLYPIPYVFIiSEWITPBAX^ 
TS PCAVWLAGGMAGAI S WGTATPMDWKSPJjQADGVYIjNKYKGV 
IjDCISQSYQKEGLKVFPRGITVI^VRGFPMSAAMFLGYEIjSLQA 
IRGDHAVTSP 


6140 


i 594 


136 


RPEIJSLWRi^RSWRPLGVPRRCHRRNWKBPVRAQPl^VTVWAP 
RCQRP/QPPAPBPSSPWAAVPEAIPTPRAAASAALELPLGPAPV 
SVAPOAEARARSTPGPAGSRLGPETFRQRFRQFRYQDAAGPRBA 
FRQI»RKL/SPRQWLRPD I \PJTKBQ\ I VBMLVQEQIjIiAILPEAAR 
ARRIRRRTDVRITG 


6141 


2 


984 


AQVGPRSRPCKMPLKLRGKKKAKSKETAGLVEGEPTGAGGGSbS 

ASRAPARiU»VFHAQriAHGSAlGRVEGFSSIQELYAQIAGAFEXS 

PSEIIiYCTLI^KIDMERLIiC^IjGLEDFlPAHVKGIEKEVl^ 

KSBDSLGLTITraiGVGYAFI KRIKDGGVIDSVKTICVGDH I ESI 

NGENIVGWPJIYDVAKJCLKEIJCKEBLFTMKLIBPKKAFEIELRSK 

AGKSSGEKIGCX5RATIiRI»RSKGPATVEEMPSETKAK\AIEKIDD 

VI^LYMGIPJ^IDIATTMFEAGKDKVN^ 

DEFVFDVWGVTGDAKRRGL 


6142 


116 


602 


BABGEQVCGAKCCGDAPHVENRKEETAR igpgvmes keeralnn 
LI VENVNQEITOEKDEKEQVANKGE PIALPLNVSEYCVPR 

fr vrqp i rjqyrwdi mhrlgepoarmree23merigeevrqlmb ki. 
rekqlshslravstdpphhdhhdefc\lmp 


6143 


2802 


276 


-ppjraiFTSCPHNi^^ 
\QLPVFKHRDS I VETLKRHRVVVVAGET\GSGKSTQVPHFKLED 
LIiLNEWEASKCNI VCTQPRRI SAVSLANRVCDBLGCENGPGGRN 
er yvwriTi7R«7a"P a r^ircTPT.T .Yr'T'PnvT .1 .H K LORDGLLSNVS /HM 
FIVDEV\HKR\SVQSDFLLI ILKEILQKRSDLHLILMSATVDSE 

kfstyfthcpilrisgrsypvevfhledi ieetgfvlekdsbyc 
qkfleeeeevtinvtskaggikkyqkyipvqtgahadlnpfyqk 
yssrtqhailymnpiikinldlilei*i*ayldkspqfrni egavli 
bt j pgi^liqqlydi>lsndrrfyserykyialhsilstqdqaaaf 

TLPPPGVRKIVIiATNIAETG I T I PDWFVI DTGRTKENKYHES S 

QMSSLVETFVSKASAIiQRQGRAGR\n2IXiFCFRMYTRERFEGFMD 

YSVPEIIiRVPLBBI*CLaiMKCNLGSPBDFIiSKALDPPQIiQVISN 

AMNLLRKIG^UJEPKIjTPW 

GCLDPVATLAAVMTBKSPFTTPtGRKD^ 

TI YMAYLGWKKARQEGGYRS e itycrrnflnrtsixtledvkqe 
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NO: 


Predicted 
beginning 
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location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine # O Cysteine, D~Aspartic Acid, E» 
Glutamic Acid, P= Phenylalanine , G=Glycine, 
H^Histidine, I=Isoleucine, fc=Lysine, 
L=Leucine, M==Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R^Arginine, 
S=Serine, T=Threonine, V=»Valine, 
W-Tryptophan, Y-Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, j 
\~possible nucleotide insertion) 








LIKI1VKAAGFSSSTTSTSWBGIIRASQTI1S FQEIALLKAVLVAGL 
YDNVGKI IYTKSVDVTBKIiACIVETAQGKAQVHPSSVWRDLQTH 
GWLLYQE KI RYARVYLRETTL ITPFPVIiLPGGDIEVQHRERLLS 
IDGW I YFQAPVKI AVI FRQLRVLI DS VLRKKLENP KMSLBNDKI 
LQI ITELI KTBNN 


6144 


1289 
> 


568 


SGPGSMSGQRVDVKVVMU3KEYVGKTSLVBRYVHORFLVGPYQN 
VSASGGARHGGRGSGGPVT(nTGPDW?PLVA\riGAAFVAKVMS 
VGDRTVTLGIWDTAGSKRYEAMSRIYYRGAKAAIVCYDLTDSSS 
FERAKFWVKEWISI*EEGCQIYI>CX5TKS 

QDYAnNIKAQLFKTSSKTGQSVDELFQKVABDWSVAAFQ\^B 
DKGVDLGQKPNPYFYSCCHH 


6145 


1109 


196 


GGMDLSBLERDNTGRCRLSSPVPAVCRKEPC^IiGVDEAGRGPVIi 
GPMVYAICYCPLPRLADIiEALKVADSKT^ 

DFVGWALDVLS pmilSTSMLGRVKYNLNSLSHDTATGLIQYALD 
QGVNVTQVFVTJTVGWPKTYQARIiQQ^ 
\VSAASICAKVARIWVKKWQPVE1MDI^^ 
/TKAVnJffiHVEPVF\GFP\QFVRF\SWRTAQTI \LEKEAEDVIR 

kdsasenqeglrkitsyflnegsqarprsshryflergi.es TTS 

L 


6146 " 


428 


781 


LKKKGKEKAEAQQVEALPGPSIilXJWHRSAGEEEDGPVLTDEQKS 
R/YPGHEAHDQGG\WDARQSlIRKVVDPETGRTRIil kgdgbvle 
BIVTKERHRE INKQATRGDCLAFQMRAGLLP 


6147 


1 


2304 


GTRQLPPPSPGSdPGDSPEGPEGEAPERRRKAHGMLKLYYGLSE 

GEAAGRPAGPDPLDPTDI*NGAHFDPBVYLDKLRRECPLAQLMD3 

ETDMVRQIPJUJ)SDM0^VYENYNKFISATDT1RKMK^ 

DKMDRLATN14AVITDFSARISATI£DRHER0^^ 

QFLFELPSRI.TKCVEI^YG0AVRYQGRAC3AVLQQYOHLPSFRA 

IQDDCQVITARIiACK^QRFREGGSGAPEQAECVELLLAIjGEPA 

EEIXXEFTAHARGRIjEKELRNLEAELGPSPPAPDVI J KF'rDHG\ S 

sgfvgglcqvaaayqklfaaogpagaeklrafarqlgsryfalv 
errlaqrcx^ggdnsi^vraldrfhpjilrap^ 

EIVERVAP^RXGHHLG^IJRAAFI£CLTO^ 

GLAELLANVAS S ILSH I KASLAAVHLFTAKEVSFSNKPYFRGEP 

CSQGVREGLIVGFWISMCXyPAQS FCDS PGEKGGATP PALLLLLS 

RIXrLDYETATISYILTLTDBQFT^VQDQFPVTPVSTLCAE^ 

RPJbLTHYVKVQGLVISQMLRKSVETRDWI^ 

VEDTTAT DVQVLPRLAGVALTQAGGTWSRGAGAAEDHWQS LPG 

GGDMCIKASHGASSVARASWEPC^KSPRMNTKRAGECLCPRS 

CSFSAQD YDIFAP I LP VEKQRLRVTQEVRAGLVLVLKI RPQTKTS 

CILPLPHSTGS rNSDHVPTK 


614B 


3656 


353 


VPAVGGTFADGAMGEABKFHYIYSCDLDINVQLKIGSLEGKREQ 

KSYKAVI>BDPMLKFSQLYQETCSDLiVTCQVF7^BGKPIiALPVRT 

SYKAFSTRWNWNEWLKLPVKYPDl^R 

PVGGTTVSLFGKYGIIFRQGMHDLICWPKC^ 

S STLSEDQMSRLAKLTKAHR0X5HMVKVDWLDRLTFRE I EM INKS 

VKRSSN FMYLMGG FRCVKCDDKEYGIVYYEKDGDBS S P I LTS FE 

LVKVPDPO^SLFmVF^KRHNLPRSLRSGPSDHDLKPYPSPRDQ 

LKNIVS YPPSKP PTYEEQDLVWE FRYYLTNQDKALTKI LTSVI W 

DLpQGAKQALALIX3XWKPMDVEDSLELI^ 

LRQADDEDLLMYLLQLVQALK YENFDDI KNGLEPTKKDSQSS VS 

ENVSNSGINSAEIDSSQIIT/SAPFPSVSSPPPVRSECTKEVPDG 

ENLBQDLCTFLI SRAS KNSTIJVNYLYWYVT VECEDQDTQQRDP K 

THEMYI*NVMRRFSQALLKGT>KSWVMR^ 

KAVQRESGNRKKKNEPJX5^LGDNEKMNLSDVELIPLPLEPQVK 
IRGII PBTATLFKSALMPAQLFFKTEDGGKYPVI FKHGDDLRQD 
QLI LQ I ISLMDKLLRiCENIiDLKLTPYKVIATSTKHGFWQFIQSV 
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SBQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=? Alanine, C^Cysteine, D=Aspartic Acid, Ess 
Glutamic Acid, F=:Phenylalanine, Gs=Glycine, 
H=Histidine, I=Isoleucine, K^Lysine, 
L=Leucine, M=Methionine, N-Asparagi ne , 
P=Proline, Q=*Glutamine r RwArginine, 
S= Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=« Unknown, *»Stop 
Codon, /possible nucleotide deletion, 
\=spossible nucleotide insertion) 








PVABVLDTEGSIQNFFRKYAPSENGPNGISAEVMDTYVKSCAGY 
CVITYILGVGDRHLDNLLLTKTGKLPHIDFGYIIiGRDPKPr.PPP 
MKLNKEMVTK5WGGTQSEQYQEF^ 

SLMVDANI PDIALBPDKTVKKVQDKFRLDLSDBBAVHYMQSI.ID 
BSVHALPAAWBQIHKFAQYWRK 


6149 


l 


1413 


RVDPRVRENGTANP I KNGKTS PAS KDQRTQKKTS VQGQ VQ KGND 
BSESDFBSDPPSPK3SEEBEQDDEEVLQGEQGDPNDDDTEPENL 
GURPLLMDS KDEEKEEKHSSDSDYEQAKAKYSDMSSVYRDRSGS 
GPTQDI^I JjIjTSAQ1>SSDVAVETPKQE FDVFGAVPFFAVRAQQ 
PQQBIOffiKNIiPQHRFPAAGLBQEEPDVPTKAPPSKKVNVQE^ 
VGPEAirriPGYPKSVDVFGSTPPQPFLTSTSKSESNEDLFGLVP 
FDE ITCSQO^KVKQRSLQKLS SRQRRTKQDM3 KSNGKRHHGTPT 
STKKTLIOPTYRTPERARJUIKKVGPJIDSQSSNEFLTI SDSKENI S 
VALTDGKDRGNVIjQPEESLLDPFGAKPFHSPD\LSWHPP\HQGIi 
S\DIRADHNT\VLPGR\ PRQNSLHGSFHSADVLKMDDFGAVP / F 
LTELWQSITPHOSQQSQPV\ELDPFGAAPFPSKQ 


6150 


372 


37 


MSNIKKYI IDYDWKAS I B I EIDHDVMTBEKLHQINNFWSDSE YR 
LNKHGS VLNAVLt WLAQHALL1AI SSDLNAYGVVCEPDWNDGNG 
QKGWPPMDGSEGIRITDIDTSG1P 


6151 


1555 


521 


DSNQQSVSGTAASTLLHS FKA'I I YYQGTGHVQQF YGVT5 PYSQT 
TPPI VQ3YAQPSLQYIQGQQI FTAHPQGVWQPAAAVTl'I VAPG 
QPQPLQPSEMVVTl^IJJIjPPPSPPIU^KTIVI/PPKVJKTARDPEG 
KIYYYBVITROT(^PPTWESPGDDASLEHEAEMDLGTPTYDEN 
PMK\ASKKPKTAEAITrSSEI*AKKSKEVFPJCEMSQFIVQCLNPYR 

kpix:kvg\ritttedpkhiarklthgvmnkeijcycknpe\dlec 

NENVKHKT KEY I KKYMQKPGAVYKPKBDTEFRVTVGPGWEDGWS 








GKTDSRBRKSCGPFCSTPVSTVmiMIHHPGEFNPADVN 


6152 


1366 


648 


NRTW3TPSTWMGVALPP1CSTGPWPVTKQITARTTCUAVPAKCP 
PWC/DVK^PRCQPPIX^IGHGrCVDGlICQCTGHFWRGPGCDELDC 
GPSNCSQHGLCTKTGCRCDAGWTGSNCSBBCPLGWHGPGCQRPC 
KCEHHCPCDPKTGNCSVSRVKQCI^PPEATIJlAGBLSFFTRrAW 
J^ALTrJU J At , LLLlSTAANLSLLLSRAEF^NWU_.HGDYAYHPIX}EM 
NGEPLAABKEQPGGAHNPPKD 


6153 


2 


3368 


GRVGARSPGRAYALLLLIil CFNVGSGIiHLQVLSTRNENKLLPKH 
PKLVRQKRAW I TAP VALLEGEDLSKKN P I AKXHS DIiABERGUCI 
TYKYTGKGITEPPFGIFVFNKIJTTjELNVTSIIiDR^ 
YAIiDARGNNVEK PLELR IKVIJ^INDNEPVF/rQDVFVGSVEELSA 
AHTLVMKINATDADE PNTLNS KI S YRI VSLEPAYP P VFYLNKDT 
GEIYTTSVTIJDPsEEHSSYTLTVEAPJOGNGBVTI)KPVKGAQVQ 
ILDVNDNI PVVENKVLPOiVEElQVNVEVTRIKVPDADBIGSDN 
WLANFTPASGNEGGYTOIETDAQTNEGIVTLIKBVDYBEMK^ 
FSVrVA>«CAAFHKSlRSKyKPTPrPIKVK\n<NVKEGIHFKSSVI 
S I YVSESMDRSSKGQ 1 1 GNFQAFDFJTTGLPAHAR YV1CLEDRDNW 
ISVDS VTSB IKLAKLPDFESRYVQNGTYTVKI VAI SEDYPRKTI 
TGTVLDTVElJIiroNCPTLIEPVOT 

SGPFSFSVIDKPPGMAEKMKIARQESTSVLLQQSEKKLGRSE IQ 
FLISDNOGFSCPEKQVLTLTVCF^^S \GCREAOHDSYVGLGP 
AAIAIJ^IIJVPIJ^IJiVpi^mCHCGKaAKGFTPIPGTIF^HP 
WNNEGAPPEDKWPS PLPVDQGGSLVGRNGVGGMAKEATMKGS S 
SASIVKGQHE>KEMIX5RWBFjmSIXSGRATQFTGATGAI \MTTE 
TTI TARATGAS RDVAGAQJ^AAVALNBEFLKN Y FTD KAAS YTEED 
ENHTAKIXIIiLVYSQE RTE SLNAS IGCCS FI EGELDDRFIxDDLGL 
KFKTItABVCIiGQKI D XNKK I EQRQKPATETSMNTASHS LCEQTM 
WSENTYSSGSS FP VPKSLQEANAEKVTOE IVTBRS VSSRQAQK 
VATPLPDPMASRNVIATETSYVTGSTMPPTTVIIiGPSQPQSLIV 
TERraPASTLVIXJPYANEGTVVVTERVIQPHGGGSWPLEGTQH 



457 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 

NO: 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment- containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, K= 
Glutamic Acid, F«Phenylalanine, G=OLycine, 
H=Histidine, I=Isoleucine, K-Lysine, 
L= Leucine, M -Methionine, N=Asparagine , 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Tnreonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, X-Unknown, +=Stop 
Codon, /-possible nucleotide deletion, 
\opossible nucleotide insertion) 








I^DVPYVMVRER^SPI^SSGVQPTLAMPNIAVGQNVTVTSRVL 
APASrU^SYQIPTBNSMTARNTWSGAGVPGPLPDFGLESSGH 
SNSTI TTSSTRVTKHSTVQHSYS 


"6154 


3660 " 


214* 


KKKTKMKNTLGJKTVNFGAWPKPTI SDKSHLLQMVSKLDLTDAKN - 
SDTAHIKSIBITSILNGLQASSSSAEDSBQEDERGAQDMDNNaK 
EKSKIDHLTlWRNDLISKEBQNSSSrj^KEKKVHADLVISKPVSK 
SPBRIiRKDIBVLSBBTOxEEDEVTKKRTO 

RGKRRYCNTKECUCTGS PGKKKBKAKNKESLCMENSSNSSSDED 
KEETKAKMTPTKKYNGI*EEKRKSLRTTGFYSGFSKVABKRIKIiL 
NNSDERLQNSRAKDRKDWSSIQGQWPKKTIjKETiFSDSDTEAAA 
SPPHPAPEEGVAEESUrnrWSEESCSPSVELEKPPPVNVDSKPI 
EBKTVBVNDRKABPPSSGSNPSA* IPI,PYLHU9RLHQSL*QKGS 
RQQSSVTVSEPIAPNQBEVRSrKSKTDSTIEVDSVAGBLQDLQS 
ERE* LASRP* OQCELKQ * *SARTRTS * KSLYRSBKSERCSGRRK 
PIKKABKKP* SNSGKQQKEGK 


6155 


669 


121 


HLI*PEI^3KSWITMKYVFYIXJVi J AGrFFFADSSVQKEDPAPYLV 

YLKSHFNPCVGVLIKPSWVLAPAHCYTiPNIJCVWL^ 

TEQT2^IQlVRY>JNYSHSAPOJ3DI^LXiaJuXPAMrOTKVOAI»N 

P\P"I^NVRPGTVCLI^IJ>WSQENSGPJIPDIJiQNLEAPVMSDRE 

C3QKTBQGKSHRNSLCVKFVKVPSR3 FGBVAVATVI CKDKLQGIE 

VGHFMGGDVGIYTNVYKYVSWIENTAKDK 


6156 


5725 


3984 


GTSTVTHATKKHFSIILNIjIjGMLIiKKDNQOT 

VMKKSBTYAPLFCLPSPHKFCKGLLADTLVHDyNICLQACSSLH 

ALSS S LPDDIiLQRCVDV CRVQLVHRGTCI RQAFGICLLKS I PLGV 

PLSNNNHTEIQEISLALRSHMSKAPSNTPHPQDFSD/VISPILY 

GNSH11TGKI)KWLERLPYSCQRI^KRDQSTIPRN1J>KTDAVLWQW 

A IWEAAQ FTVLS KLRTPLGRAQDTFQT I EGI I RS LAGHTT,NP DQ | 

DVS QWTTADNDEGHQNNQIJnjVLLIjQ YIiENLEKLM YNAYEG CAN 

ALTSPPKVlRTFIjYTNRQTCQDWLTRIRIiS IMRVGLLAGQPAVT 

VRHGFDLLTEMKTTSLSQG1IELBVS IMMVVEALCBLIK^EAIQG 

IAVWSSS IVGKHLLWINSVAQQAEGRFEKASVEYQEHLCAMTGV 

IXXZISSFDKSVLTIiASAGCKSASLKHCI^GESRKSVLSKPTDSS 

PWINYIXJNKACECTISTADWAAVQEWQNAIHDLKKSTSSTSLN 

LKADFWYIKSLSSFESGKFVECTEQL^ 

XDMKKLLRNM 


6157 


946 


329 


MANRGPSYGLSREVQEKIBQKYDADLBNKLVDWI ilqcaedieh 
PPr j GRAHFQKWIJ4DGTVIXnC»INSI»YPP^ 
OMEQISQFUCAAETYGVRTTDI FQTVDLWEGiaDMAAVQRTLMAI, 
GSVAVTIOJIXSCYRGEPSWFHPJCAQQNRRGFSEEQLRQG^ 
QMGSNIDGJISQAGMTGYGMPRQIM*DAASCP 


6158 


441 


1482 


LGSLIVISLHCKVIFS^SLERAMKEKAVDLW^ 
PILEGKDHNQNTGVDPI IDHVQDRKTD/SRSKS PHKKRSKSRBR 
RKSRSRSHSRDKRXDTREKIKEfCSRVKEKDREKBREREKEREKE 
KRl^KHKDRfiKFBR^n^RFKPK^KPRTfflBRP^ i ^HEKDRPK BKB 
QDKEKERBKDRS KE I DEKRKKDKKSRTPPRSYNASRRSRSSSRE 
RRRRRSRSSSRSPRTSICrlKRKSSRSPSPRSRNKKUKKREKERD 
HISERRERERSTSMRKSSNDRIXSKEiajEXNSTS^KEKEHNKEPD 
SSVSKEVDDKEAPRTEKNKJiQK^NCQLNEENLSTKTEAV 


6159 


53 


84 


AVIAPLH I S LGDRARPYLKNTBKSSTTCSRRRNQSFPPVMSLTII 
RLKOXKYWGCAVSNVCRFWEGRPI^rjMIWPYTLPVSl^VGSCV 
I ITGTPII»TEVKDPQLEVNFTrGMDEDSDIAFQFRl»HFGHPAIM 
NSCV EG I KR YEEKCYYLPFEDGKPFE1.C I YVRHKE YKVMVNGQR 
!YNFAHRFPPASVimQVFRDl6LTRVLISD*GRCVRITAVQEF 
DVS VS CDCTTAYQPG 


6160 


1626 


1790 


AGAKFFP* P*K3^ADAQPTBSEKEIYNQVNVVLKDAEGILEDLQS 
YRGAGEE I REAIQHPADEKLQEKAWGAVVPLVGKLKKFYEFSQR 



458 



WO 01/53312 




SEQ 
ID 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^ Alanine, C=*Cysteine, D^Aspartic Acid, E=* 
Glutamic Acid, F- Phenyl alanine , G=Gly cine , 
H=Histidine, I-Isoleucine, K=I»ysine, 
L-Leucine, M=Methionine, N=*Asparagine. 
P=Proline, Q=Glutamine, R-Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyro3ine, X=Utoknown, +«Stop 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 








IiRAALRGIjLGALTSTPYSPTQHLEREOAIiAKQFAEIIiHFTIiRFD 
ELKMTNPA IQNDFS Y YRKTIjSRMR INNVPABGKNBVNNE LANRM 
SLFYAKATPMLKTLSDATTKFVSKNKNLPIENTITKZLSTMA^ 
RVMLBTP BYRS RFTNEE TVS FCLRVMVGVI ILYDEVHP VGAFAK 
TSKI DMRGCI KVLKDQP PNSVEGLLNBVLRYTTKHLinJETTSKQl 
KSMLQ*QIiItTLVNKG 


6X61 




1569 


PVSGSBSSLRRAWAS I LRLMJjG PRVAVS I LCEDGISH* IiLEKH* 

KSHVL£PLSSLALEEQCIiAXiSIiDWSTGKTGRAGIX3PI>KI ISSDS 

TGQK^L>VVNETRPRI 1 QKVASWQAHQFEAWIAAFNYWHPEIVYS 

GGDDGIXRGWirreVPGKFLFTSKRHTMGVCS I QSS PHREHTLAT 

GSYDSBTIJiMDTONMKQPIiADTPVQGGVWRIKH 

CMHSGFKlLKCQKAMEERQEATVLTSHTLPDSLVYGADWSWriLF 

RSI^RAPSWSFPSNIXrTKTADLKGASELPTPCHECRBDNDGEGH 

ARPQSGMKPLTEGMRKNGTm^ATAATTOIX!GVNPEEADSAFSL 

LATCSFYDHALHIiWEWEGN 


6162 


1 


586 


RTIHATGRAGAS PMHRlj IVWRLAEANWQHVRCQKCLEFGHWTYE 
CIGKRKYXHRP SH.TAELKKAIJKE KENKWj1»QOS IGETNVERKAK 
KKRSKSVTSSSSSSSDSSASDSSSESEETSTSSSSEDSDTDESS 
SSSSSSASSTTSSSSSDSDSDSSSSSKQ*HQHR*QL*R*TTKEE 
BKEIELLHSYWTDGLKTLM 


6163 


1081 


785 


R IRS TTBGCAVRLHPTQNTG KAR1M I LIjSVS bGRHWAFTYKPFL 

TPVVFVFFFFFFHRKfi*VMQKNPMKSRBDEHMBKLNOTJ^ 

MNRLIMNYLVTEGFKEAAF^FRMBSGIEPSVDLETLDBRIKIRE 

MIIiKGQIQEAIALINSLHPELLDTNRYIiYFHLQQQHI.IELIRQR 

ETEAAIiEFAO/TQLAE^EESRECLTEMERTLAIJAFDSPEESPF 

GDLLHTMQRQKVNSKWQA.VIJ3YENRESTPKIJUCLLKLLLWAQN 

ELDQKKVKYPKMTDLSKGVIEEPK 


6164 


90 

• 


406 


PCQS PGRS RMRQD KI/TGS LRRGGRCLKRQGGG VGT I IiSNVLKKR 
SCISRTAPRI^CTl^PGVmnCLKFTIiEPSD^ 
VARI^TGIPKEWRKVWLTIiADHYl^SIAIDWDKTMRFTFWERS 
NPDDDSMGIQIVia)LflRTCCSSYCGQEAEQDRVVl*KRVIjIiAYAR 
WNKTVG YCXJGFNIIiAAL I LEVMKGNEGDALKIM I YL I DKVLP ES 
YFVNNLRALSVDMAV FPJDLLRMKLPEliSQRIiDTLO^TANKESGG 
G YEPPLTNVFTMQW FLTliFATCliPNO/rVI^IlTOSVPFEGSE I IL 
RVSLAI WAKLGEQ I E CX!F^ADE FYSTMGRLTOJBMIiENDLIjQSHE 
LMQTVYSMAPFPFPQIjAELREKYTYNI tpfpatvkpts vsgrhs 
KARDSDBBNDPDDEnAVVNAVGCLGPFSGFLAPBl^KYQKQIKE 
PNEF^SIiR3NNIAEI.SPGAINSCPvSEYHAAFN5M!WERMTTDIN 
ALKRQYSRIIOCKQO^VHOVYIRADKGPVTSILPSQVNSSPVIN 
HLIJjGKKMKMTORAAiOlAVIH I PGHTGGKI S PVP YRDLKTKLNS 
PtfRTHlRVHKKNMPRTKSHPGCGDTVGLIDEQNEAS^ 
EAFPSGCTATAGRF^SSPEGSTRRTIKGQSPEPVFGDADVDVSA 
VQAIG^GALEL^RIJAAAETELRVHPP CQRHCPEP P S APE ENKAT 

HFPQMSRSFS KPGGGNSGP * KMVFS SGTMLS RQLPG Y PQEYQRN 
GGERFG 


6165 


90 


406 


PCQSPGRSRMHQDKbTGSLRRGGRCLKRQGGGVGTlIiSNVIiKKR 
SCISRT^RLI^rniBPGVOTKMFTfcEP 

VARLSTG I PKE WRRKVWLTLAPHYXJIS IAI DWDKTWRFTFNERS 
NPDDDSMG I Q I VKDLHRTGCS S YCX^HAEQDRWLKRVLLAYAR 
WKKTVG YCQGPNIIAAIiI LEVMEGNEGDALK I M I YL IDFCVLPES 
YPVNNLRALSVDMAVFRDLLRM^ 

GYEPPIjTNVFTMQWFXTLFATCLPNQTVIJCIWDSVFPEGSEI il 
RVSLAIKAKLGEQIECCETADEFYSTMGRLT^EMLFJTOLL^ 
IJ^O/I^SMAPFPFPQLfliLREKYTYNITPFTATVKPTSVSGRIIS 
KARI)SDEBNDPDDEnAVVNAVGCLGPFSGFLAPELQKYQKQIKB 
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ID 
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Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Aianine, C=Cysteine, D=Aspartic Acid, E=» ' 
Glutamic Acid, F=» Phenylalanine, G -Glycine , 
H-Histidine, i=lsoleucine, K=Lysine, 
L= Leucine, M=Me thionine , N^Asparagine, 
P=Proline, Q^Glut amine, R^Arginine, 
ScSerine, T=Threonine, V=Valine, 
W^Tryptophan, Y»Tyroeine, X=Unknown r *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PNBKQSIiRSNNIAELSPGAINSCRSKYHAA^SMMMBriMTTDIN 
ALKRQYSR1KKKQQQQVHQVYIRADKGPVTSI LPSQVNSSPVIN 
HIXLGKKMKMTNRAAKNAVIHIPGHT^ 

PWRTHIRVHKK3IMPRTKSHPGa5DTVGLIDE^JEASKTNGI«GAA 
KAPPSGCTATAGREGSS PEGSTRRTIEGOSPEPVPGDADVDVSA 
VQAKLGALELNQRDAAAETELRVin>PCX)RHCPE P PSAPBKNKAT 
SIO^OGSNSKTPIFSPFPSVKPIJIKSATARNI^LYGPTBRTPTV 
HPPQMSRSFSKPGGGNSGP * KMVFSSGTMLSRQLPGYPQEYQRN 
GGKRFG 


6166 


2 


1206 


HKLWRTVAMAGABWKSLBEOiEKHLPLPDLQEVKRVl»YGKBLRK 

I^LPREAFEAASREDF^IWYAFKAAEEQIJIRPRIVHVGLVQNR 

IPLPANAPVAEQVSALHRRIKAIVEVAAMCGVNI ICFQBAWTMP 

FAFCTREKLFm , BFABSAEIX3PTTRF(X?KLAKNHI>MVW PILE 

RDSEHQDVLWm^WVISNSGAVLGKTRl^ 

EGNLGIIPVFQTQFGRlAVNICTGPJffiPI^LMYS INGAB 1 1 FN P 

SATIGJVLSBSLWPIKARNAAIANHCFTCAINRVGTEHFPNBFTS 

GIX3KKAHQDFX3YFYGSSYVAAPBSSRTPGLSRSRDGLLVAKLDI* 

NU^VNDVWNFlvmGRYB 

VPALG 


6167 


1220 


1B44 


YGIVTGPSU^DKQPKKQEKNPVLVSPEFVDEAI^ACBEYLSN 
LAHMD IDKDLEAPLYLTPEGlf SLFLQRYYOW 
VQRCEDILQQLQAVVPQ1DMEGDRNXWIVKPGAKSRGRGIMCMD 
HLEEMLKLVNGNPVVMIvI)GKWWQ 

IiVTDWNPLTVWFYRDSYIRFSTQPFSLKNLDK*APLYLTPEGWS 
LFL£RYYQVVHEGAELRHIJ)TQVQRCE^ 

DRNIW I VKPGAKSRGRGIMCMDHLBEMLKLVNGNPVVMKDGKWV 
VQKYIKRPLLIFGTKFDliRQWFLV*rDWNPLTVWFYRDSYIRFST 
QPFSLKNLDK 


6168 


84 


1392 


VWPVPSVSAMPPKKQAQAGGSKKABQKKKEKI IEDKTFGLKNKK 
GAKCXJKFIKAVTHQVKFGQQNPRQVAaSEAEKKLKKDDKKKEI.Q 
ELNKLFKPVVAAQKISKGADPKSWCAFFIOQGQCTKGDKCKFSH 
DLTLERKCEKR^VYIDARDEELBKDI'MDNWDE KKLEEWNKKHG 
EABKKKPKTQI VCKHFLBAI ENNKYGWFWVCPGGGD I CMYRHAL 
PPGPVLKKKKKKKJK3CBDBISL # DLIERBRS^^ 
I^KKRKKQEKIDKI^DMERRJCADFKAGKALVISGRBVFBFRP 
ELVNDDDEEADDTRYTQGTGGDEVDDSVSVND 1DLS LYT PRDVD 
ETG I TVASLBRFSTYTSDKDENKI^EASGGRAENGBRSDLBEDN 
E^EGTENGAjrmVPVDBI^FTGEDLDKJaF^ELNTIjDLEE 


6169 


112 


662 


APAAAMAERPEDLNLPNAVITRIIKEAI*PIXJVNISKEARSAISR 
AASVFVLYAISCANNFAMKGKRKT^^ 
PUffiAIiEAYRRBQKGKKEASBQKKKDKDKK^ 
DBDEBRIiEEEBQNBEEEVDN* RGRETVTvPWKVPLEMRRATCFCE 
AFPCWAE 


6170 


62 


667 


STKVMLPNTGRIAGCTI^F ITGASKGIGKA1 AIiK/^IX^ I V IA 
AK. r/*UlrHFKJ_ilAaTI YTAAKK I KJW*jGKA1jP\- 1 VJU VKOEQQISAA 
VEKAI XKFGGIDI LVNNASAI SLTNTLDTPTKKLDLMMNVNTRG 
TYLASKACI PYIJCKSKWAHIPmSPPL»LNPVWFKQHCGRW*VV 
G*GDGLCLICFBLNLCMSDVITICT 


6171 


3B2 


941 


HFMOSDVEI^DIEPaSHTKFPPTLPI^TTVIVCSCHPVATAST 
MAEAFSKTTSEEDQSXQEPKKANSMTAQKQKK*GLRGSRRRHAN 
SGGDIFGDSFAAYFPRVLKQVHQAI^LSQEAVSVMDSMVRDILD 
R IATEAGI ILAHYS KCVTI TSRD IRMAVCLLLPGKMG JCLAESQGT 
NATLRYTKSK 


6i72 


651 


54 


GLCRAGGAHRFSRTHVBAAIJCMIjRRKARIiKRBYLYRKAREeAQR 
SAQRRKBRLRRALEENRLI PTBI^RBALALQGSLEFDDAGGEGV 
TSHVDDEYRWAGVEDPKVMITTSRDPSSRWMFAWSLXLVP^ 
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SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence ' 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C-Cysteine, D=Aopartic Acid, K= 
Glutamic Acid. F»Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine, K-Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Aiginine, 
S=Serine # T=Threonine, V»Valine f 
W-Tryptophan, Y^Tyrosine, X-Unknown, *-Stop 
Codon, /-possible nucleotide deletion, 
\~possible nucleotide insertion) 








QRMNRGRHE VGALVRACKANG VTDLLVVHEHRGTP VGIi I VS HI*P 
FGPTAYFTLCNWMRHDI PDLGTMSEAKPHL ITHGFS SRLGKRV 
SDILRYLFPVPKDDSHRVITBT^DDYISFRHHVYKKTDHRNVB 
LTEVGPRFBLKLYMIRIiGTI^QEAl'ADVKWRWHPY^ 
I»STB * AAPRPLGQLI* 


6173 


3 


288 


SVDHRBVQVLSQSMPLTPHQAVLRGBRPYMCVECGKCFGRSSHL 
LQHQRIHTGEBCPYVCSVOSKAFSQSSVLSKHRTIHTGEKPYECN 
ECGKAFRYSSDIAQHHKIHIGEKPHECLECRKAPTQI^HLIQHQ 
R I HTGERPYVCPLCG KAPNHSTVLRSHQRVHTGKKPHRCNECGK 
TFS VKRTLLQHQR THTGBKP YTCSBOGKAPSDRSVLIQHHNVHT 
GEKPYECSECGKTPSHRSTLMNHERIHTEBKPYACfYECGKAFVQ 
HSHIilQHQKVHRKL* PTCVLSVGSALAGVPTSFS IS VSTLERS P 
MCAVYVGRPSARAG^LVNTGQPTQVRSPMSVMSVEKPIiE 


6174 


1060 


959 


PRPPGKRWMVAGIjGNPGLPCTRHSVG^VI^ 
RDRHCAADIJU^LGDAQLVrJ,RPRRLMNANGRSVAPJVAELFGL 
TABBVYLVHDKLDKPU?RI^KLGGSAPX^GVRSCISCn^SNA 
MPRLRVG IGRPAHPEAVQAHVLGCFS PAEQKLLPLLLDRATDL I 
LDHIRERSQGPSLGP *H * WFSKKA 


6175 


2204 


334 


RYFPJUDPRSRSGQPRAEGI/5AFAEGPIiRAMAAPVKGNRKQSTEG 

EMJJPPASPKPAGKQNGIQNPISLEDSPEAGGBRBEBQERBBBQ 

AFLVSLYKFMKERHTPIERVPHIjGFKOII^WKIIYKAVEKIjGAYE 

LVTGRRLWKNVYNEI^GSPGSTSGATCTRRH Y* RLVLPYVlUn^ 

GBDDKPLPTS KPRKQYKMAKBNRGDDGATERPKKAKEERRMDQM 

MPGKTKADAADPAPLPSQKPPRNSTEOXX3IASGSSVSFVGASGC 

PEAYKRLLSSPYCKGTHGIMSPIAKKKU^VSKVEALQCQEEG 

CRHGAEPOASPAVHLPESPQSPKGLTENSRHRIjTPQEGIiQAPGG 

SLREEAQAGPCPAAPIPKGCPYTHPrEVLKPVSQHPRDFFSRIxK 

DGVIjU3PPGKEGLSVKEPQLVWGGDANRPSAFHKGGSRKGILYP 

KPKACWVSPMAKVPAESPTLPPTFPSSPGIiGSKRSIiEBBGAAHS 

GKRXRAVSPFIiKEADAKKCGAKPAGSGLVSM^ 

YRGTMLHCPIiNFTGTPGPLKGGAALPFSPLVI PAFPAHFLATAG 

PSPMAAGLMHFPPTSFDSAIJJHRLCPAfiSAWHAPPVTTYAAPHF 

PHLNTKL 


6176 


1040 


402 


PLSALRAMAEVHVIGQI IGASGFSESSIiFCKWGIHTGAAWKI*I>S 

GVREG<nX3ViyrPQIGDMAYWSHPIDI^ATKGLCK^PRI^FO^ 

SQDSFGRCQLAGYGFCHWSSPGTHQIACPTWRPICSWRBQl^ 

AFVGGGPQIrf^GDTIYSGADRYRLirnUVG^ 

RYGVEC*GriiPPTSPPSTPRTPSDGGG«HSGQBHRI* 


6177 


1400 


992 


VPIESLVGKVHNFPL21AFYCCEKGKRQPHKSLHDRCFGKALDPN 
CSHCYLDQI KRSDFLGFSGYS PHFVAISTNSBHKMQPSSMQQAL 
PSQ*PYWTDPRPALVPCCSHRPDVHRSRPGPGIiPGTSGCSDRPP 
VCPI 


6178 


1027 


254 


STGJRGGIKGVARAASLVGRRRAGTGMAIJJUCLVCLTAAIiAHGCI* 
HCHSNFSKKFSF YRHHVN FKS WWVGDI PVSGALLTDWSDDTMKE 
LHLAI PAKI TRKKUXJVATAVYQMMIX^^YQGKMYFPGYFPNELR 
WIFRBQVHLIQNAI I ESR 1 DCQHRCGI FQ YETI SCNNCTDSHVA 
CFGYNCESSAQWKSAV(^I^NYINNWHKOJ>TSMRPP^SAFSW 
THRAAPAFLVLPALRCIiEPimANIiSLBDAA*CLKQH 


6179 


806 


276 


RGETREMAGNrj^GAGRRLWDWVPIACRSFSLGVPRI, tGI RLTL 
PPPKVVDRWNEKRAMFGVYDWIGILGWFBKHPKELIRGPIWLRG 
WKGW.T^RCIRKRKMVGSRMFADDLHtn^KRIRYLYTG^FNRHGK 
FR*-KRKLRTS EKAHLS PWRRETVLPPVRKRIiCI FS VI KWGPPG I 


6180 


156 


1833 


DHHILKAASTTHVCARGNIFAI PNTRCLBC+ATATPSSLBCQN * 
SHLSLCPLPATTSGLTPNSMI PKKBRQNIAERLLRVMCADLGAL 
SWSGKE FbKLAQTLVDSGAR YGAFSVTE ILGhm^I JUiKHIiPR 
MYNQVKVKVTCALGSNACLG IGVTCHS Q S VGPDSCY I LTA YQAE 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=* Alanine, C=Cysteine, D=Aspartic Acid, K= 
Glutamic Acid, F^Phenyl alanine, G=Glycine, 
H=Hietidine, Isisoleucine, K^Lysine, 
L=Leucine , M=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R^Arginine. 
S=Serine, T=Threonine, V- Valine, 
W^Tryptophan, Y^Tyrosine, XMJnknown, *-Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








GNHIKSYVLGVKGADIRDSGDLVHHWVQNVI^EFVMSEIRTVYV " 

TDCKVSTSAFSKAGMCXRCSACALNSWQSVLSKRTI.OARSMHE 

VIBLLNVCEDI^STGI^KETFGSLEOTSPPP 

HRRYEQICEPySRAKKMNL IQSLNKiDULSNLAAILTPVKQAVIB 

LSNESQPTLQI»VLPTYA/RI£KLFTAKANDAGT^ 

iUSW Ir kVMEAHKVAM I LDPQQKLR PVPPYQHEE X IG KVCELINE V 

KESWABBADFBPAAIOKPRSAAVENPAAQEDDRIiGKNEVYDYIiQE 

PLFQATPDLFQYWSCVTQKirrKLAKliAFWIu^^ 

MCEQALLI KRRR1*I»SPEDMNKLMFLKSNML 


6X81 


169 


1032 


TRTLI^PVLLPGPRWKCTireRRPMGPlJ^ 

YYLIQFCGHSWIFlllMTVRFFSFGKDSMVDTFYAIGLVMRLC^S 

VSIJ^LI^IYVGIBSHHLLPRFLQLTERIIILFVVITSQEBV^ 

KYWCVLFVFWNLLDMVRYTY3MLSVIGISYAV^ 

TVPTX^/Ti2Vlv AT?ZVTVA<Jr.PV r PRC17riTVQTV'T tjwt^t.ct wovirr v t 
a x tr ijv viKUUuiu x x r C2>J? i» l x«j iiviurr LJIjo X Irfx VLiRj. 

YXMMIiFIGMYFTYSHLYSERRDII^IFPIKKKKM*STAFOO 

KDRLWIQCSK*NTGSILVEKFLVF 


6182 


1769 


1224 


AS* IDYQl^TIiLKEFQLTEfiN'rKIiRYLTCSrjIEDMAAAYFPDCI 

VRPFGSSVNTFGKIX^IJ^FLDIJJBTRNI^A^ 

VKNVPSERlMQKII£VIX3Era^^ 

PftWflA tlTZ WClCTlT .TTTCNTI TaT.TQ<JPr.T.VTVnhT.nCD\roiiT tHTCiR} 
* onUnouffUUlul Xi^rviCX/VLiXooCiXkLiI X XljtfUjIJoKVAAXiVIroVK 

CWARAH3LTSS IPGAWITNFSLTMMVIFFLQRRSPPIIiPTLDSI* 
KTLlADAEDKCV^EGIWCTFVPJ)LSRIK^ 

YFGNFAFDKNS XNlRQGREQNKPDSSPLYIQNPFETSLNISKNV 
SQSQI^KFVDIiARF^WII^EDTDRPSISSNRPWGLVSLLLPS 
APNRKSFTKKKSNKFAI ETVKNLLES LKGNRTENFTKTSGKRT I 
STQT 


6183 


1118 


452 


HxJDRYIKSPGSGSSTPAPPSHLIiLYIJjHPQSTRTMGCCGCSRGC 
GSGCGGCG3SCGGCGSGCGGOGSGRGGCGSGCGGCSSSCGGCGS 
RCYVPVCCCKPVCSWVPACSCTSCGSCGGSKGGCGSCGGSKGGC 
GSCGCSQSSCCKPCCCSSG03SSCCQSSCCKPCCCQSSCCVPVC 
CX)SSCCKPCCCQSNCCVPVCCQCKI *GSGPRPSGFSCXVKAFLM 
VP 


6184 


1 


2191 


IVTVREEDGAPAVAPPGVVVSUANKRSGAGPGGSGGGGARGAEE 
EPPPPl^VLVADSFDRRFFPISIODQPR^I^IANVALIDYTLE 
PLTATGVQF/rFVFCCWKAAQlKEB^ 
BLYRSl^DVLRDVnAKALVRSDFlVLVYG^ 

LRRKL*KNVSVMTMIFKESS PSHPTRCHEENVVVAVI^TTNRVI. 
HFQKTO^LRRFAFPIiSLFO^SSDGVKVRYDIJ^C^ PQVA 
QI^TDNFDYQTRDDFVRGLLVNEE ILGNQIHMHVTAK^ 
NLHMYSAVOuOVIRRWVYPLTPEANFTDSTTXJSCmSRHNIYRG 
PBVSLGHGSILFJSNVl^GSGTVIGSNCFI^ 
NWLDQTTLWQGVRVAiVGAQIHOS^ 

TSQVVVGPNITIiPEGSVISLHPPDABEDBDDGKFSDDSGADQEK 
DKVKMKGYNPAEVGAAG KG YI/WKAAGMNMEEEEELQQNLWGLKI 
NMEBESESESEQSMI^EEPDSRGGSPOMDDIKVFQNEVLGTLQR 
GKEEN IS CDNLVLB INSLKYAYN I S LKKVMQVLSHWLEFPLQQ 
MDSPLDSSRYCAUiLPLLKAWS PVFRNYIKRAADHLEALAAI ED 
FFLFJ1RAI/3ISMAKVLMAFYQLEIIJVEETII^ 
QLRKNQQLQRFIQWIiKEAEEESSEDD 


618S 


791 


44 


PCTS CVLWATLHL P ASTKKAPQAECGM I S ITEWQKIGVGITGFG 
IFFILFtSmJYFDSVLLAFGNLI^TGLSLriGI^KTFWFFFQR 
HKLKGTSFLI^WIVIJjRWPIJ^MF^ 

GFLGNVCMI PFLGAkFRRLQGTSSMV* KTEMSSLNLDHWLKGAK 
REEWEPPPOS PALTHS PTYPGPPOVQKERNGAEQLTSNPQVDSR 
GCQEAE^QTPRRLGWGWYHTLTIjYLWBEK 


6186 


569 


238 


VYGIDS3NTTnTOAEERNRKiKKHWKI>CH^ 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

icolUUc OC 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to firet 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 

Alanine, C=Cysteine, D=*Aspartic Acid, E- 
Glutaraic Acid, F=Phenylalanine, G=Glycine, 
H=*Histidine, I^Isoleucine, K-hysinc, 
I»=Leucine, M«Methionine, N-Asparagine, 
P= Proline, Q=Glutatnine, R=»Arginine r 
S»=Serine, T=Threonine, V=Valine, 
WteTryptophan, Y- Tyrosine, X=Dnknown, *»Stop 
Codon, /=possible nucleotide deletion, 
\=>possible nucleotide insertion) 








KERKVKNKVKNKADTOEVFNNSPTt^^ 

NIRNQMETLHSQPHQEEinaCFBMSFSLlNLnPINAVEPTSSQQI 

PWRETSEANKERRKMTSKSSESNiySPLTSFTaADSHEJDilllCD 

LEIX^JWGK-nt^DlAPWTIiRI PTSNSE I KGVCSVGCCYHLLSR 

EPENQHKERTQEKHGFPMCHyi.KEERWCa3RNARMSACl^I^V 

AAGQGLPTESLFiT^VLQDIIKDCYGITKCI>RHVGKIYSKCS3P 

LDYVRRSLKKI/jLDKSKIjPBKI imnyyekykprmnblrafnmlk 

VVLAPCIETLILIJ>RI»CYIiKEQEDIAWSALV]^ 

VTALKKQQ*FPLKQIIRCISL*DSAGCAEEVSVGDGGPAIjRDAP 

PSGSRVGSRYD 


6187 


1701 


771 


DAWGPETR LARILNPD5 FI E PRPGRLPELEATRPHMEPKASCPA 
AAPLMERKFHVLVGVTGSVAALKL^ 

ERAKHFYS PQDIPVTLYSDADEWEMWKSRSDPVUIIDLRRSiADL 

LLVAPLDANTI^KVASGIOTNIJjTCVM^ 

AMWEHP ITAQQVDQLKAPGYVE I P CVAKiOWCGDEGLGAMAEVG 

Tl^DKVKBTOFQHSGFQQS *PGISVMGVP1»YSEWVQAKSVKMDV 

GKIGGYPHIiLNGGPALS LPRGQACSRIjqWTEGPGLS PFQPGEAA 

A 


6188 


238 


1534 


KGFV3)AGPIi*ABLQVSPQWKAPEMSQic^ 

NlGVPICIRC^rHRNIX^ISRVKSVNLIXJ^EQIQCMQEMG 

NGKAmi.YEAYLPETPRRPQIDPAVEGFIRDKYEKKlCYMDRSLD 

INAFRl^KllDKWKRGSEPVPEKKLEPWPBKVKMPQKKEDPQLP 

RKSS PKSTAPVMDLIiGIiDAPWCSlANSKTSNTLBKIJIiDLIiASV 

PSPSSSGSRKVVGSMPTAGSAGSVPSNLNLFPBPGSKSEEIGKK 

QI1SKDSII1SLYGSQTPQMPTQAMFMAPAQMAYPTAYPSFPGVTP 

PNSIMGS1WPPPVGMVAQPGASGMVAPMAMPAGYMGGMQASMMG 

VPNGM>TITQOAG YMAGMAAMPQTVY GVQPAQQI»QWNLTQMTQQM 

AGMNFYGANGMMMYGO^MSGGNEQAANQTLSPQMWK 


6189 


1297 


793 


IiGBPI^DLCELIPGDVQQLQMGEVHPGlXlAQGSAAQSVAGZVQL 

TQLSHARQRPS CQGS QLIAIaDLQHMDI SRQPRWQHVQP VARQVO 

RAQQAQIMGVAVHXWAGDAWAEVELrXJEVGGGKVFAANACDL 

WQDHEGA / HAAROATCHALQRVIVQnniRVOPI»KAL 

RVRAFMILHNQITGIGREJ>FATTYFIjEEL^^ 

AFRKLRLIjI^LDLSGIIRLHMLPPGLPP^ 

GALAGMAQLRELYl^TSNRljRSPJu^Xn>RAHVD 

LTB I PEGLPES LBYIiYLQlOOCISAVPANAFDS TPNLKGI FLRFN 

KlAVGSWDSAFIiPJtjRHbQVLDl^BGNLEFGDIS 

BEEEDEVEEEETR 


6190 


""' 66^ " 


J.JU9 


I bVGNV SFItUS FAE Y v CNCS V VGSLN VWRCNCH"1\5| < ^Ck* , ^*U 

GLHCBTCKBGFYIiNYTSGLCQPCDCS PHGALS T PCNTSSGKCQCK 

VGVIGSICERCftlXiYYGFSKNGCLPC^ 

CQENS K3NHCEECKEGFTQSPDATKEC1.RCPCSAVTSTC 

SffiELEPECDO^KDGY^GPNCinDCEWGYYKFDSI 

PVKTPKICKPBSGECINCIjHNTIXS k 

KVIXPTPEGSTILVSNASLTTSVPTPVINSTFTPTTIjQTIFSVS 

TSENSTSALADVSWTQFNI I ILTVI I IVVVLLMGFVGAVYMYRE 

YQimKLNAPFWTIELKEDNISFSS YHDS I PNADVSGLLEDDGHB 

VAPNGQLTLTTPIHNYKA 


6191 


1212 


1511 


VNLCMGGtXHLSTHHLGI KPSMH* LFFl^SF^HLTPQQPKCPS 
MIDW I KKI WY I xTMEYYATl KRNB IMF FAGTWMEM EAI I LS KLH 
QDYMFSLISGS 


6192 


3 


950 


TRGCGNKMAGKlQSfVIjSSIiAVYAEDSE 

KGGLVS DAYGEDDFS RLGGDEDGYEB EEDENSRQS eddds ete k 
PEADDPKDNTEAEKRDPOEliVASFSBRVPJMSTOElKIPPE 
RCSNHLQDKIQKLYERKI KEGMDMNYI IQRKKEFRNPS I YEKL I 
QPCAIDELGTNYPKDMFI>PHGWSE1^YYEALAKAQKIEWDK1 
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S3Q 
ID 
NO: 


Predicted, 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AeAlanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F*»Phenylalanine, G=Glycine, 
H=Histidine, I=»Isoleucine, Kafcysine, 
L= Leucine, K»Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R^Arginine, 
S-Serine, T=sThreonine, V-Valine, 
W= Tryptophan, Y=Tyrt>sine, X= Unknown, +«Stop 
Codon, /=pos8ible nucleotide deletion, 
\=possible nucleotide insertion) 








AKKE RTK I E FVTQTKKGTTTN ATS TTTTTAS TAVADAQKRKS KW 
DSAI PVTTIAQpT I LTTTATIjPAVVTVTTSASGS KTTVI SAVGT 
IVKKAKQ 


6X93 


3 


950 


TRtK^KMAGKKNVLSSIAVYAEDSSPESPGBAGIEAVGSAAEE 

KGGLVSDAYGEDDFSRLGGDEDGYB3BEDENSRQSBDDDS3TBK 

PEADDPKDNTEABKRDPQELVASFSERVRKMSPDEIKIPPEPPG 

RCSNHLQDKI QKLYBRKIXEGMDMNYI IQRKKRPRNPSI YEKLI 

QFCAIDEIiG^NYPKDMFDPHGWSBDSYYEAIiAK^ 

AKKERTKIE FVTGTKKGTTTNATS TTTTTASTAVADAQKRKS KW 

DSAIFVTTIAQPTILTTTATIiPAVVTVTTSRSG^ 

IVKKAKQ 


6X94 


3 


950 


TRGCX3NKMAGKXNVLSSIJVVYAEDSEPESEXiEAGIEAVGSAAKE 

KGGIiVSmYGBDDFSRLGGDBDGYEBEEDENSRQSEDDDSETEK 

PEADDPKDNTEAEKRDPQELVASFSERVRNMSPDEIKIPPEPPG 

RCSNHLQDKIQKLYERKI KBGMDMNY 1 1 QRKKEFRNPS I YE KLT 

QPCAIDEX^TNYPKDMFDPHGWSBDSYYEALAKAQOEMDKLEK 

AiOCERTKlEFVTGTKXGTTTNATSTTrTTASTAVADAQKRKS^ 

DSAIPVTTIAQPTILTTTATLPAVVTVTTSAS^ 

IVKKAKQ 


6X95 


736 




VANGI^SNMPRFYCXWCIHTI/iro 

YYQK«MKBQAOSLIDKTTAAPXJQGKIPPTpFSAPPPAGAMIPPP 
PSLPGPPRPGMMPAPHMGOPP^PMMGPPPPGMMPVGPAPGMRP 
P>K5GHMPMMPGPPMMRPPARPMMVPTRPGMTRPDR 


6X96 


1512 


623 


KTGKRRS AAYVRN ILDNAEQVISNLKARNLC^RLTPLliQEEDSH 
QRiliMGIiMVSELICDHFLRHIjQCT 

HIVBTNvmKHNI^$W\aiHFN3RGSAAEFAVFHIMTRlLEA'rNSL 
FLPIiPPGFHTLKTILGVQCLPliHNLI^ 

MKDI^DNTEKNBiaKFSIIVRLPPLIGQKICRLWDHPMSSNIISR 
NHVTRIJ^NYKKQPRNSMINXSSFSVEFLPLNYFIE ILTDI ES S 
NQALYPFEGHDNVDAEFWEAAIiKHTAMLLGL 


6197 


3 


8X9 


ADP^TEBUVVMSRYTRPPMTSliFIRNVADATRPBDLRREFGRYG 

PIVDVYIP^FYTRRPRGFAYVQFEDVRDAEI^YWLNRKWV 

RQIKIQFAQGDRKTPGQMKSKERHPCSPSDHRRSRSPSQRRTRS 

RSSSWGRNRRRSDSLKESRHRRFSY6QSKSRSKSLPRRSTSARQ 

SRXP1UWFGSRGRSRSKSLQKR5KSIGKSQSSSPQKQT8SGTKS 

RSHGRHSD8 IARSPCKSPKGYTNFETKVO/TAKHSHFRSHSRSRS 

YRHKN3W 


6193 


ill 


" X912 


SKAAI^PSFISPACFI^RiQ^PALEDGTLPHPM'LGMNYBGARSB 

RENHAADDSEGGALDMCCSERI.PGriPQP IVMPALDEAEGLQDSQ 

REMPPPPPPSPPSDPAQKPPPRGAGSHSLTVRSSLCLFAASQFIj 

IJtfX^WFSGYGHIWS<^TNI»VSSIXTI»^^ 

GVPSIJJiVFl^^LVLVTTLVWHIJiRTPPBPPTPLPPEDRRQSV 

SRQPSFTySEWMBEKIEt>DFIjDliDPVPETpVFI>CVMDIKPEADP 

TSIiTVKSMGLQERRGSNVSLTLDMCTPGCNEEGFGYI/ISPREBS 

AREYXLSASRVtiO^ELHEKAIJJPFIXQAEFFEIPMNFVDPKEY 

DIPGLVRKWYKTILPNPH3RVCLTSPDPDDPLSSYINANYIRG 

YGGEEKVYI ATQGPT VS1*VADFWRMVWQEHTP I IVKITNI EEMN 

EKCT^TWPEEQVAYBGVBITVQKVIHTEDYRLRIjISLKSGTEER 

GL1CHYWPTSHPDQKTPDRAPPLI^VR^VBBAA<XJEGPHCAPII 

VHCSAGIGRTXICFIATSICCOQLRQEGVVDILKTrCQIiRODRGG 

MIQHCEQYQFVHHVMSIiYEKQIiSHQS PE 


6199 


144 


X211 


MARENGESSSSWKKOJ^IKKIFEFKETI^lGAFSEWIJ^ 

tgklfavkci pkkalkgkessieneiavtj^khenivalrdi y 
bs pnhlylvmqlvsggelfdr i ve kgfytekdastklrqvldav 
yylihr^ivhrdlkpenixyysqdeeskimisdfgi*skmegkgd 

VMSTAC^PGYVAPBVLAQKPYSKAVDCWS IGVTAYiriLCGYPP 
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SEQ 
ID 
NO: 


Predicted 

Kan i Itn "1 Tin 

nucleotide 
location 
corresponding 
to first 

cfUU.1117 dLlU 

residue of 
amino acid 
sequence 


Predicted end 

location 
corresponding 
to first 
amino acid 

amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=phenylalanine, G-Glycine, 
H=Histidine, I^Isoleucine, K=Iiysine r 
L=:I*eucine, M-Methionine, N^^Asparagine, 
P=Proline, Q=Glutamine, R^Arginine, 
S = Serine , T=Thireonine , V=Valine # 
W oTryp t ophan , Y=Tyrosine, X=?Unknown, *~Stop 
Codon, /=possible nucleotide deletion, 
\=»possible nucleotide insertion) 








FYDENDSKLFEQII»KAEYBFI>SPYWDDISDSAKDFIRNIjMEKDP 
NKRYTCEQAARHPWXAGDTALNKNIHESVSAQIRKNFAKSKWRQ 
AFNATAVVRHMRIOjHLGSSI^SSNASVSSSI>SIiASQKIX3lSGTP 
HAL* 


6200 


702 


96 

1 


LPEVPHSLRPRVKPHLCX^VQPAVRVMARIiPKIiAVFDIiDYTLWPF 
WVDTHVDPPFHKSSDGTVRDRRGQIWRXiiPEVPEVL^ 
PGAAASRTSBIEGANQLIiBIjFDIiFRYFVHREIYPGSKITHFERL 
QQKTGI PFSQMI FFDDERRW I VDVSKIK3VTCIHIQNGMMLQTLS 
QGLETFAKAQTGPIiRSSLEBS FFK A 


6201 


2809 


2383 


GQTPRVRWKMRRSIiHAGKPJIQTAGRKSKSPPKVPlVIQDDSLPA 
GPPPQIRILKRPTSNGWSSPNST5RPTLPVKSLAQREAEYAKA 
RKRILGSASPBERQEKPILDRPTRISQPBDSRQPNNVTRQPLGP 
DGSQGFKQRR 


6202 


2 


426 


INADRAAVASS LiLS RPTRKMAPOKDR KPKRSTWRFNItDbTH FVK 
PG I FDSGNFEQFLREKVKVKGHTGNI/5NVVH IKRFKNXITVVS E 
KQFSKRYIJCYLTKKYLKKNNIJ^DmjiVVASDKETYBLRyFQISQ 
DEDESESED 


6203 


419 


2550 


RCPRPPATAGAAASRPDRS PPSG ISGSEAAAGAQAAAPASQHPA 
TG1*GAVQTEAMKQ I LGV I DKKLRNIjEKKKGKLlDD YQBRM N KGER 
LNQDQLDAVS KYQBVTNNLBFAKBLQRSFMALSQDIQKTI KKTA 
RREQLMRBBAEQKRIiKTVLEX&YVXjD 

PILSEEELSLLDEFYKLVDPBRDMSLRLNKQYEHASIHLWDIiLE 
GKEKPVCXJTTYKVLKEIVERVFQSNYFDSTHNHQNGLCEEEEAA 
SAPAVEDQVPEAEPEPAEBYTEQSEVESTBYVNRQPMAETQFTS 
GEKEQVDEWTVETVBWNSLQQQPQAASPSVPEPHSLTPVAQAD 
PLVRRQRVQDLMAQMQGPYl^IQDSMIiDFENQTIiDPAlVSAQPM 
NPTQNMDMPQLVCPPVHSESRlAQPNQVPVQPEATQVPIiVSSTS 
EGYTASgPLYQPSHATEQRPQKEPIDQIQATISLNTDQTTASSS 
LPAASQPQ V FQAOTS K PLHSSG I NVNAAPFQS MtfTVFKMNAPVP 
PVNBPEIl>K(X>NQYQASYNQSFSSQPHQVECrrELQQEQI<rrVVG 
TYHGSPDQSHQVTG1WQQPPQONTGFPRSNQPYYNSRGVSRGGS 
RGARGLMNGYRGPANGFRGGYDGYRPSFSNTPNSGYTQSQFSAP 
RDYSGYQRDGYQQNFKRGSGQSGPRGAPRGRGGPPRPNRGMPQM 
NTQQVN 


6204 


2933 


787 


CTHNLISLIX^RAI^HFNRFIJ«^IQEGBAHNIPCPAYDCFQIiV 
PGDI IKBWSKEMDKRYLQFDIKAFVENNPAlKWCPTPGCDRAV 
RLTKQGSNTSGSDTIjS FPLLRAPAVDCGKGHLFCWECLGEAHE p 
CDCO/TWK1WLQKI TEMKPBELVG VS KAYKDAAN CLWLLTNSKPC 
ANCj^PIQKNI^iCNIiMQCAKCKYDFCWICLEEWKKHSFVHHKVl 
YRCTRYEVIQHVEEOSKEMTVEAEKKHKRFQEI^DRPM^ 
NHEHSYQLBQRIJ>KTAKRKMRQI^RAI*^ 
HVLLKTRRILKCS YP YGFFLEPKSTKKK I FELMQTDLEMVTEDL 
AQKVNRP YLRTPRHKI IKAACLVQQKRQEFIA3 VARGVAPADS P 
EAPRRS FAGGTWDWBYIjGFAS pbeyaefoyrrriirqrrrgdvhs 
LLSNPPDPDEPSESTLDIPEGGSSSRRPGTSVVSSASMSVLHSS 
Sr J RDYTPASRSRNQD3LQAI J SSLDBDDPNILIAIQLSLQESGLA 
LDEETRDFI^NKASLGAIGTSI»PSRI.DSVPRNTDSPRAALSSSE 
LIjEXGDSLMRI/3AENDPPSTDTLSSKPLSEARSDFCPSSSDP0S 
AGQDPNINDl^LGNIMAWFHDMNPQS IALI PPATTEI SADSQLP 
CIFOX;SEGVKUVELVLPEDSMFEDA3V3EGRGTQlEENPIiEENI 
PGGGKQHPQAW 


6205 


1 


1200 


rahrgkmalevgdmedgqls ds ds dmtva ps drplqii pkvlggd 
samrafqntaiacapvshyravesvdssees fsdsdddsclwkr 
krqkcfnpppkpbpfqfgossqkppvaggkkinniwgavlqbqn 

QDAVATELGIIXJMEGTIDRSRQSETYNYt^ 
IJ)KELDBYMRGGKK>K3SKEEBNGQGIlI^RKRPVia)RriGNRPEMtf 
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Amino acid segment containing signal peptide 
(A=*Alanine, C=»Cysteine / D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H==Histidine, I^Isoleucine, K=Lysine, 
L=*Leucine, M=Methionine, N-Asparagine, 
P=Proline, Q=Glutamine, R»Arginine, 
S=Serine, T=Threonine , V«Valine, 
W^Tryptophan , Y«Tyrosine f X=Unknown, *=Stop 
Codon, /=posaible nucleotide deletion, 
\=pos3ible nucleotide insertion) 








Y KGR YE ITAEDSQK KVADK IS FRLQE PKKDLIARWR1 IGNKKA 
I3I^ETABVEC^GGLFIMNGSRI^TPGGVFIJSIIiraTrPSISEE 
Q I KD I FYI ENQKK YKNKKAARKRRTQVLGKKMKQAI KSLNFQKD 
DDTSRETFASiyrNEALASIJ5ES0EGHAEAKLEAEEA I EVDHSHD 
LDXF 


6206 


10 


1442 


IISERRERSC^LVCIRCSCDVVl^SVI^U^MASWIPCLCGS 
APC^CRCCPSGNNSTVTRIjIYAI^LLVGV<^ 
LNKIPGFCKNEKGVVPCNIIiVGYKAVY^LCTOLA^ 
IKTsAKSSSDPRAAVHNGFWFFXFAAAIAIIIGAPPIPEGTFTTVW 

fyvgmagafcfiliqlvllidfahsknesnvekmeegnsrcwya 
allsataijmilslv7u:vlpfvyythpascsbnkapi svnmllc 
VGASVMS ilpkiqesqprsgli^ssvitvytmyltwsamtnepe 

TNCNPSLI^I IGYNTTSTVPKEGQSVQWHHAQGI IGLIIiFLI^TV 
FYS 3IRTSNNSQVNIO»TLTSDESTLIBDGGARSDGSL»EDGDDVH 
RAVDNKRIXJVTYSYSFFHFMIiFIiA^ 
KSQWTAVWVKISSSWIGIVLYVWTLVAPLVLTNRDPD 


6207 


2924 


1471 


1'VMAKAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAG 

GGGGGSDGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHDbSDSPY 

SWCKYFQRGYCIYGDRCRYEHSKPLKQEEATATBLTTKSSLAA 

SSSLSS I VGPLVEMNTGEAESRNSNraTVGAGSEDWVNAI EFVP 

GQPYC»RTAPSCTBAPLQGSVTKB3SEKEOTA 

VGECRYGENCVYLHGDSCDMCGliQVIillPMDAAQRSQHI KSCIBA 

HEKDMELS FAVQRS KDMVOG I CHnEVVYEKANPSERRF^II^NCN 

HTY C1.KCI RKWRSAKQ FES K 1 1 KS CPECRI TSNFV1 PSEYWVBE 

KEEKQKLILKYKEAMSNKACRYFDEGRGSCPFGGNCPYKHAYPD 

GRRESPOROKVGTSSRYRAQRRNHFWELIEERENSNPFDNDEEE 

VV^FSIjGEMI^LIAAGGDDELTDSEDEWDLFHDELEDFYDLDI* 


$208 


2924 


1471 


TVMAKAATPGTTATTSGAGAAAATAAAAS PTP I PTVTAPSLGAG 

GGGGGSIX^SGGGin'KQVTC^Fl^GVCKEGDNCRYSHDLSDSPY 

SWCXYFQRGYCI YGDRCRYEHS KPLKQBEATATELTTKSSLAA 

SSSLSSIVGPI,VEMNTGRAESRNSNPATUGAGSFJ3WVNAXF^VP 

GQPYCGRTAPSCTFJU>I^SVTKEESEI^(^AVF/na^LCPYAA 

VGECRYGENCVYLHGDS CDMCX5LQVLHPMDAAQRSQHI KS CIEA 

HEKDMEI^ FAVQRSKI^VXXSI CMEVVYEJKANPSERRFG I L3NCN 

HTYCLKCIRKWRSi^FESKIIKSCPEailTSNFVIPSEYWVEE 

KEEKQKLIliKYKEAMSNKACRYFDEGRjGSCPFGGNCFYl^^ 

GRRi^PO^KVGTSSRYRAQRRNHFWKLIEERENSNPFDNDEEE 

VVTFELGEM1iLMIJjAAGGDDEIjTDSED5WDLFHDFXEDFYDIJ5L 


6209 


■ 1750 


829 


ERLCFPCMQS KI YSYMSPNKCSGMRFPLQEBNSVXHHB VKCQGK 
PLAG I YRKREE K31NAGNAVRSAMKS EBQKT KBARKGPLVP FPNQ 
KS BAAE PPKTPP SSCDSTNAAIAKQAIiKKPI KGKQAPRKKAQGK 
TQQNRKLTDFYPVRRS SRKS KABIiQS KERXR IDELIESGKE EGM 
KIDLIIXJKGRGVXATKQFSRGDFVVEYIIGDLIEITDAKKREAIjY 
AQDPSTGCYMYYFQYLSKTYCVDATRETNRIjGR^ 
TKLHDIDGVPHLIIiIASRDIAAGRBLLYDYGDRSKASIKAHPWL 
KH 


6210 


3761 


387 


ik^skij^mvlledsgsjujfrrhfvnlspftitvvlli^cfvt 
sslggtdkex,rlvdgenkcsgrvevkvqef^gtvomgwsmeav 
svicnqlgcptai kapgwans s agsgriwmdhvs crgjjesalwd 
ckhdgwgkhsncthgx)dagvtcsdg snlbmrltrggnmcs gr i e 

IKFQGRWGTVCDDNFNIDHAS VICRQLECGSAVS FSGSSNPGEG 
SGP I WFDDLI CNGNE SALt^CKHQG WGKHNCDHABD AGVI CS KG 
ADI>SI»RLVDGVTECSGRLRVR FQGEWGTICDDGWDS YDAAVACK 
QLGCPTAVTAIGRVNASKGFGHI WLDS VS CQGHE PAVWQCKHHE 
WGKHYOmNBDAGVTCSDGSDLEIiRI^GGGSRCAG^ 
I^KVCDRGWGLKEADVVCRQLGCGSALKTSYQVYSKIQATNTWI. 
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Amino acid segment containing signal peptide 
(A=Alanine, C«Cysteine, D=Aspartic Acid, B=s 
Glutamic Acid, F=phenyl alanine, G-Glycine , 
H=Histidine, I^Isoleucine, K^Lysine, 
L=Leucine, M=Methionine, N^Asparagine, 
ps=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T^Threonine, V=oValine, 
W =Tryp t ophan , YaTyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








FLSSCNGNETTSLWDCKNWQWGGLTCDHYEEAKITCSAHREPRLV 
GGDIPCSGRVEVKHGDTWGS ICDSDFSLKAASVLC^ELQCGTVV 
S IUSGAHrcBGNGQIWAEEFQCEGHESHI^I^CPVAPRPHOTCSH 
SRDVGVVCSRYTBIRLVNGKTPCEGSVELKTIjGAWG 
IEDAHVLCQQLKCGVALSTPGGARK3 

HMGDCPVTAI^ASLCPSEQYASVICSGNQSQ'rLSSCNSSSLGPT 
RPTI PBESAVACI ESGQLRLVHGGGRCAGRVBI YHEGSWGTI CD 
DSWDLSIlAHW^QLGCGKAINATGSAHreEGTGPIWI^EMKCK 
GKESRIWQCHSHGWGQ^CRHKKDAGVICSEFMSLRLTSEASRB 
ACAGRLEVFYNGAWGTVGKS SMS ETTVG WCRQLGCADKGKINP 
ASLDKAMSIPMWVDNVQCPKGPDTLWQCPSSPWBKKLASPSBBT 
W ITCDNKI RLQEG PTS CSGRVE I WHGGSWG WCDDS WDLDDAQV 
VC^DGCGPAIJaFKEAEFGQGTGPIWLNWKCKGNESSIjWDCP 
ARRWGHSECGHKKnAAVNCTOISVQKTPQKATTGRSSRCSSFIA 
VGILGVVLIAIFVALFFLTKKRRQRQRIAVSSRGKNLVHQIQYR 
EMNSCLNADDLDLMNS SGGKSE PH 


6211 


3761 


367 


I FGMSKLRMVLLEDSGSADFRRHFVNLSPFTITWLLLSACFVT 

SVICNQLGCPTAIKAPGWANSSAGSGRIWMDIIV3CR0NESALWD 
CKHDGWGKHSNCTHQQnAQVTCSuGSNI^ 

I KFO^WGTVCDDNFNIDHASVICRQLECGSAVSFSGSSNFGEG 

SGPIWFDDLICNGNESALWNCKHQGWGKHNCDHAEIW3 

ADLSLRLVT)GVTECSGRI>EVIo7QGBWGTIOT 

QLGCPTAWAIGRYNASKGFGHIWI^SVSCQGH^PAWQCKHHE 

WGKHYCNHNKDAiJVTCSIXSSDLEXRLRGGGSRCAGTVEVEIQR^ 

I^KVCDRGWGUCEADWCRQI^CGSAIJCTSYOWSKIQATNTWL 

FLS S CNGNETS I>WDCKNWQ WGGLT CDHYEEAK I TC3 AHRE PRLV 

GGDI PCSGRVEVKHGDTWGS IO^DFSIiFAAS V^CREIjQCGTVV 

SIKJGAHFGECWGQIWABEFQCEG^SHLSLCPVAPRPEGTCSH 

RRDVGWC^RYTEIRLViraKTPCFX5RVEIjiaXGANGSr*CNSHWD 

IETDAHVIXX2QLKCG VADSTPGGARFG KGNGQ I WRHMFH CTGTEQ 

E^IX!PVTAW5ASL(^SEQVASVICSGNQ^G/IT5SC1^SSLGPT 

RPTI PEBSAVACI ESGQIiRLVNGGGRCAGRVBIYHEGSWGTICD 

DSWDI^DABVVCRQIiGCGBAINATG5AHFGEGTGPIWLDBMKCN 

GKESRrwO^IHSHGWGQONCRHlGSDAGVlCSBFM 

ACAGRIiEVFYNGAWGTVGKS SMSBTTVGWC^ 

ASU3KAMSIPMWVDNVQCPKGPDTI.WQCPSSPWBKIUiASPSBKT 

WITCDNKIRLQEGPTS CSGRVB IWHGGSWGTVCDDSWDLDDAQV 

VCQQLGCGPALKAFKKABFGQGTGP IWLNBVKCKGNESSLWDCP 

ARRWGHSECGHKKDAAVN CTDISVQKTPQKATTGRS SRQSS FIA 

VGILGVVLLAIFVALFFLTKKRRQRQR1AVSSRGENLVHQIQYR 

EMNSCLNADDLDLMNSSGGHSEPH 


6212 


1 


1134 


LKWEI^PGGAVWGTGRGAGTGAPRSCCCQTNPGPPSSLRRAFRR 
RBLPFPACHEIGLGABAGSGPPPAPAARESRSRAMEBBASSPGI, 
GCSKPHJUBKJj i JUG 1 ITvlJUKoo JrviV X K VI X XnR.tr riUSKiinXoo HH 
QKlWCVMPEDvTCNFYljMIiraFHim IPLGSMAINSI 
SKLTQLTQSSI4YSLPNAPTLADLEE5DTEEASDDQPEKPHFDSRS 
VI FBI JDS CNGSGKVCLVYKSGKPAlAED*rEI WFLDRALYWHFLT 
DTFTAYY RJ^ITHLGLPQWQYAFTSYGIS PQAKQRVSMYKP ITY 
NTNIiLTBETDSFVNKLDPS KV FKS KNKI VT PKKKGPVQP AGG QK 
GPSGPSGPSTSSTSKSSSGSGtfPTRK 


6213 


1 


1134 


LKWFJjRPGGAWGTGRGAGT^APRSCCCQTNPGPPSSLRRAFRR 
RELPFPACHB IGlJGABAGSGPPPAPAARBSRSRAMBBBASSPGL 
GCSKPHLEKLTUSITRII^SPGVTFAn'IIEKPPABliHMISSWE 
QK^CVMPBDVia^YIJCENGFBOT 

SKLTQLTQSSrfySLPNAPTLADIiEDDTHPJ^DDQPEKPHFDSRS 



467 



WO 01/53312 



PCT/US00/34263 



SEQ 

TO 

NO: 


Predicted 

Kfvf ■) nn i net 
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sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=*Aspartic Acid, 
Glutamic Acid, F»Phenylalanine, G^Glycine, 
H=Histidine, I-Isoleucine, K=Lysine, 
I^Leucine , M=Methionine , N-Asparagine , 
P= Proline, Q=Glut amine, R=Arginine, 
S=Serine, TsThreonine, v=valine, 
W^Tryptophan, YoTyrosine, X=Unknown, *=Sbop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 








VI FELDS CNQSGKVCLVYKSGKPALABDTB I WPLDRALYWHFLT 
DTFTAYYMOiITaLGLPQWQYAPTSYGISPQAKQRVSMYKPITY 
tTTNIiLTBETDSFVNKIiDPSKVFKS KNKTVI PKKKGPVQPAGGQ K 
GPSG?SGPSTSSTSKSSSGSGNPTRK 


6214 


2 


460 


HKIAPSAIRRAARLGLGPARWQSRAAAFY FVRGFRTGWS PVGWV 
VW!WSAKRT1^FFFLSKMAASSRAQV^ 

YRTYAVRRIRDAFRENKNVKDPVB I QTLVNKAKRDLGVIRRQVH 
IGQLY3TDKLI IENRDMPRT 


6215 


2 


1849 


FVAGG PRGSGSAABTM PK I RVTPLGAGQDVGRS C I L»VS 1AGKNV 
MLDCX3HHMGPITDDRRFPDFSYITQNGRI»TJJFLDC^^ 
a?ALPYFSEMVGYIX:J>IYtmiPTGAICPlLLED^ 
ANFFTSQMIKDCMKKVVAVHIiHQ^ 

YATTIRDSKRCREHDFIiKKVHETVroGGK^IP 
IIJOTFHBRMI«iKVPIYFSTGl.TBKANHYxK^ 
FVQRNMFBFXHIKAFDRAFADNICPWV^^ 
RKHAGNEKNIWIM PGYC^QGTVGHIG; I»SGQRKIiEMKGRQVLEVK 
MQ VKYMS FSAHADAKGI MQ LVGQAE P KS VIJjVHGEAKKME FLKQ 
KIBQEIJlWCYMPANGBTVTLPTSPSIPVGISIiGlji^^ 
LPEAKKPRIiLHGTLIMKDSNimLVSSEOAL^ 
RVKLHDTPJCBQETAIiKVYS HLKSVL in)HCVQHLPDG3 VTVES VIj 

t />>\ > nOT3r\n/"nniT T Trci«rT»VyM"Yl?OT Y^CTTT.TQT.T.VTfOTiUOTVOQ 
liUAAAIra KlJ t*lal JtVijJ-iVoW J. lUl/EXSlArol: uAoululAUJJJtrVfUr£> 


6216 


11 


393 


QTTRPEPRNSALRQSRSKMAVVGVSSVSRriIjGR5RPQLGRPMSS 
GAHGEEGSARMWKTLTFFVALPGVAVSMIiNVYLKSHHGKHERP 

PTHVOUT dtetvd u W~"H\l r T*T .T?HTCD1TUTtfT?T .UTYiY PTTR 

r £A i PiiLwC JL KTAr r irrttjlAjMn X lirnx* trtxvrii? ±jf iwtuo 


6217 


9 


1178 


TRVGRGESGLKMSVKPPPGRPQPDSQRRRRRRGeEGHPPKEPEQ 
IJtKLFIGGI^FKTTDDSLREHFEKWGTLTIKVVMRDPQTKRSRG 
PGFVTYSC^BVOAAMC^PHKVDGRVVEPKPvAVSREDSVKPGA 
HLTVKKI PVGG I KKDTEB YNLRDYFBKYGKIBTIBVMEDRQSGK 
KRGFAFVTFDDHDTVDKI VVQ OHTIl^GHNCEVKKAl»SKQEMQS 
AGSQRGRGGGSGHFMGRGGNFGGGGGWFGRGGNFGGRGGYGGGG 

IjVJOX\nJ^ X UvwIA^IHUf wUJVUUillvwwcvi jjivuui *Jvvvrv J- vj 

NQGGGYGGGGGYIX5YNEGGNFGGGNYGGGGNYNDFGNYSGQQQS 
NYGPMKGGSFGGRSfeGSPYGGGYGSGGGSGGYGSRRF 


6218 


130S 


906 


S CERR^FI MADDIJCRFLYKKL PS V^GIiHAIVVSDRDGVPVI KVA 
NDNAPBHALRPGFLSTFAIATDQGSKIiGLSKNKSI I CYYNTYQV 
VQFNRLPLWSFIAS S SANTGLIVSLE KELAPIiFEEkRQVVEVS 


6219 


2 


890 


AGPGEGAGAGTRCAGAEAEMASAGGEDCBSPAPEADRPHQRPFI. 

IGV^GGTASGKSTVCBKTMELlJGQNEVRQRQRKVVIIxSQDRFYK 

Vr.TABQKAKALKGQYNFDHPDAFr^UOmTIJailVEGKTVEVP 

TYDFVTHSRLPF/ITVVYPADVVXFEGILVFYSQBIRDMF 

VDTDSDVRI^RRVXRDVl^GRDLEQIIiTQYTTFVIU?AF5BPC^ 

TKKYADVI I PRGVT)NM\^INLIVQHIQDIIjNGDICKWHRGGSNG 

RS YKRTFSE PGDHPGMLTSGKRSHIiESSSRPH 


6220 


227 


764 


BQNISLEMSCTIKKAI^AKALVFmJ^HDDAAF^LIEQTTAIJ^ 
WRVEAMKQ YQKBlQKbNEVARHRPRSTI»VMG IQQENRQIRELQQ 
EN KKLRTS LEEHQS ALELIMS KYREQMFRULMASKKDDPGI IMK 
IJCEOHSKIDMWRNKSEGFFIaTASRHILEAP^ 
NVH 


6221 


98 


916 


RWIWDI^PVSIXBLBIiRPKVNGII^Cl.TriWKLJXniRGLYQGVTP 
HIWGAGI^WGLYFVFYNAIKSYKTEGlUlERLFJVrEYIiVSAAEAG 
AMTIjCITNPLWUTKTRI^I^YDAVVNSPHRQYKGMPDTLVKIYR: 
YEGVRG L Y1CGFVPGLFGTSHGALQFMAYELLKLKYKQHI NRLP E 
AQLSTVE Y I S VAAI^ KI FAVAATYP YGVVTIARLQDG^IMFY SGVI 
DVITKTWRKKGVGGFYKGIAPNLIRVTPACCITFVVYKNVSIIFTj 
LDLREKRK 
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Amino acid segment containing signal peptide 
(JWUanine, C^Cysteine, D-Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G= Glycine, 
H=Histidine, I=*Isoleucine, IfcLysine, 
L=ijeucine # M=Methionine, H^ASparagine, 
P=Proline, Q=Glutam±ne, R=Arginine, 
S=sSer lne, T^Threonine, V-Valine, 
W=Tryptophan , Y=Tyro3ine, X=Unknovro, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6222 


2 


2116 


MAREIJ?AIiLLWGRRIiKPLLRAPALAAVP^ 
PRRNPAWSLQAGRI*FSTQTAKDECEEPI»HS I ISSTES VQGSTS KH 
EF^ABTKKLLDIVARSLYSEKEVFIRBLISNASDALRKI>RHKLV 
SDGQALPEMEIHLQTNAEKGTITIQDTGIGMTOEELVSNLGTIA 
RSGS KAPLDALQNQAEASSKI I GQFGVGPYSAFMVADRVEVYSR 
SAAPGSLGYQWLSDGSGVFE IABASGVRTGTKI I IHLKSDCKKF 
SSEARVRDVVTKYSNFVSFPLYLNGRRMNTI^ 
WQHEEPYKYV7\QAHDKPRYTLHYKTDAPLWrRS I FYVPDMKPSM 
FDVSREI^SSVALYSRKVLIO^KATDILPKWI^FIRGVVD^ 
PIJiriSREiaXJBSALIRKUUXTOXJRLIKFFIDQSKim^ 
FKDYGIiFMR3GIVTATEQEVKEDIAKLLRYESSALPSGQLTSLS 
E YASRMRAGTR1JI YYLCAPWRH1AEHS PYYKAMKKKDTEVLFCF 
EQFDEIjTLIJnjREFDKlCKLI SVETDIVVDHYKEBKFEDRSPAAE 
C^EKBTBELI4AWMRNVM^RVTNVKV^^ 
AARHPliRMQQLAKTQEERAQItLQPTLE IKPRHALIKKLNQLRAS 
EPGIiAQI^VDQIYENAMIAAGLVDDPR^ 


6223 


3 


715 


DAWARTMAGMVDFQDEEQVKS FLENI4EVECNYHC YHEKD PDGCY 
RIjVDYLEGIRKNFDEAAKVLKFNCEENQHSDSCYKLGAYYVTGK 
GGLTQDLKAAARCFLMACKKPGKKS IAACHNVGLLAHDGQVNBD 
GQPDLGKARDYYTRACDGGYTSS CFNLSAMFLQGAPGFPKDMDL 
ACKYSMKACDLGHIWACftNASRMYKIjG^ 
QQVHKEQQKGVQPLTFG 


6224 


1 


j 133 


LRTI SSMAWG PI^LTLLRHCTGS WAQSVLTQPPS VSGAR I PHBK 


6225 


3259 


938 


IJ^aRIiAICKLiPFSVESRXTVMGPQGARKQAFLAFGDVTVDFT 

QlffiWRIjLSPAQRAIiYREVTLENYSHLVSLGIIJISKPBLIRRLEQ 

GEVPWGEERRRRPGPCAGIYAEHVLHPKNI/njAHQRQQQLQPSD 

QSFX3SDTAEGQEKEKSTKPMAFSSPPLRHAVSSRRRNSVVEIES 

SQGQRBNPTE IDKVLKG I ENS RWGA FKCAERGQDFSRKMMVI IH 

KKAHSRQKIiFTCREaiQGFRDE3.ALIiIjHQIfIB 

RGFSLKANLLRHQRTHSGBKPFbCKVCGRG 

TGBKPYECQECGRRFNDKSSYNKHLKAHSGEKP FVCKBCGRGYT 

NKSYFVVHKRIHSGEKPYRCQECGRGFSNKSHLITHQRTHSGEK 

PFACRQCKQS FSVKGSMJ^QRTHSGEKPFVCKDCERSFSQICST 

LVYHQRTHSGEKPFVCRECGQG FIQKS TLVKHQ I THSBE KPFVC 

KDCGRGFIQJCSTFTLHQRTHSBEKPYGCRBaSPJIFRDKSSYNKH 

IJIAHLGEKRFFCRDCGRG FTUKPNLT IHQRTHSGEKPFM CKQCB 

KSFSLKANIJJfflQWTHSGBR^FNCKDa^FILKBTTiLFHQKOT 

SGBKPFlCSBCGQGFIWKSNIiVKHOlABBGKQPFVCKECGRGFN 

WKGia^THQRTOSGEKPFVCNVCGCX5FSWKRSLTRHHWRIHSKE 

KPFVCQECIQ^YTSKSPLTVHERIHTGERPYECQECGRKFSNKS 

YYSKHIaKRHLREKRFCTGSVGEASS 


6226 


29 


266 


itcvskixggsgrlpfl^lwrri^cgujprvsp^ 
imbgggqslrvstglswli^lpwp^aqriragrsya 


— ££27"" 


2581 


890 


MSAS S LLKQR P KGGGNKVQNGS VHQKDGLNDDD FE P YJCjS PQ ARP 
MNAYTAMSDSYLPS YYSPS IGFSYSLGBAAWSTGGDTAMP YLTS 
YGQIiS^EPHFI*PDAMFGQPGALGSTPFliGQHGFN?FPSGTDFS 
AWGNNSSQGQSTQSSGYSSMYAYAPSSLGGAMlDGQSAPANBTL 
NKAPGMNTIIK^GmALKI^STEVASNVPKVVGS AVGSGS ITSNI 
VASNSLPPATIAPPKPASWADIASKPAKQQPKLKTKNGIAGSSL 
PPPPIKHNMDIGTWDNKGPVAKAPSQALVQNIGQPTOGSPQPVG 
O^VNNSPPVAOASVGOXn'QPIiPPPPPQPAQLSVG^AAQPTRWV 
APRNRGSGFGHNGYDGNGVGQS0AGSGSTP5EPHPVLEKLRSIN 
NYNPKDFDWNLKHGRVFI I KS YSEDD I HRS I KYNI WCSTBHGNK 
RLDAAYRSMNGKGPVYLLFSWGSGHPCGVAB^ 
VWSQDKWKGRFDVRW I FVKDVPNSQI*RH I RLKNNENKPVTNSRD 
TQEVPLEKAKQVLKI IASYKHTTSIFDDFSHYEKRQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, (^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glyeine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine, 
P=Proline, Q^Glutaraine, R=Arginine, 
S=Serine, ^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X»Unknown, *=Stop 
Codon, / ^possible nucleotide deletion, 
\=poasible nucleotide insertion) 


6228 


47 


197B 


GRRCKRRGAVMELAQSARKLGCWAVEEMGVPVAARAPESTLRRIi 
CLG(X^IMAYIIiO^VHSQRTVKKIRGNLLWYGHQDSPQVRRKL 
ELEAAVTRLRAEIQKLDQSLEI^ERiyrEAQiyrAMEQARQHTQW 
QRRALLLRAQAGAMRRQQHTLRD PMQRLQNQLRRLQDMERKAKV 
DVTPGSLTS AALGLE PWLRDVRTACTLRAQ FIiQNLLLPQAKRG 
SLPTPHDDHFGTS YOQWIiSSVE/TLLTNE^PGHVLAAIiEHIiAAER 
EABIRSLC^DGIiGDTBISRPQAPDQSDSSQrLPSMVHLIQEGW 
RTVGVLVSQRSTIiIiKEROyLTOIUiQGLVEEVERHVLGSSERQVL 
ILGLRRCCLWTELKAIjnXiSQELQDAAGHRQLLLRELQAKQQRT 
LKITOG^VERTOEQVRLLIKGNSAS 

PTF^AVAPQSRELLRCLEEEVRHbPHILLGTLLRHRPGEI.KPLP 
TVLPSIHQLHPASPRGSSFIALSHKIiGLPPGKASELLLPAAASIi 
RQDIJ^t££DGilSLWCWDLIiHMOT 

QKEfnXQALKRXiEKI*LKQALERIPELQGIVGDWWEQPGQAAI*SE 
ELCQGIiSLPQWRLRWVQAQGALQKLCS 


6229 


1571 " 


560 


GPSLLGTRGTPNPARTLQIFFLI IGRRLTGRMAAVDDLQFEEFG 
NAATSLTAN PDATTVN I EDPGETPKHQPGS PRGSGREBDDEIiLG 
^DSDKTELLAGQKKSS P FWTPE YYQTFFDVDTYQVPDR IKGSL 
LP I PGKNFVRLYIRSHPDLYGPPWICATLVFAIAISGNLSNFLI 
HLGEKTYHYVPEFRKVS IAAT 1 1 YAYAWLVPIiALWG FLMWRNS K 
V^rVSYSFI^WCVYGYSLFlYIPTAir>WIIPHKAVRWILVMI 
ALG 1 SGSIiLAMTFW PAVREDNRRVALATIVTI VLLHMLI.SVG CL 
AYFFDAPEMDHLPTTTATPhKrrVAAAKSS 


6230 


1723 


600 


SKMSGRSGKlOCMSKI^RSJ«U«5VIFPVGRI»MRYLKKGTFKyRIS 
VGAPVYMAAVT EYLAAE I LELAGNAARDNKKAR IAPR HI LLAVA 
NDEELNQLLKGVTIASGGVI^RlHPBUXAKXRGTKGKSETIIiS P 
PPEKRGRKATSGKXGGKKSKAAKPRTSKKSKPKD3DKEGTSNST 
SBDGPGDGFTILSSKSLVLGQKLSLTQSDISHIGSMRVEGIVHP 
'I^AEIDLKEDIGKADEKAGGKEFLETVKELRKSQGPLEVAEAAV 
SQSSGLAAKFVIHCHIPQWGSDKCBEQLEETI KNCLSAAEDKKL 
KSVAFPPFPSGRhTCFPKOTAAG^IJCAXSAHFDDSSASSLKNVY 
FLLFDSRS1GIYVQEMAKLDAK 


6231 


149 


870 


LIFSSSTMDRSIJINVLWSFGFI^FTAYGGLQSLQSSLYSEEG 
LGWALSTLYGGMLLSSM FIiPPLL I ERLGCKGT I ILSMCGYVAF 
SVGNFFASWYTLIPTSII^LGLGAAPTaWSAQCrrYLTITGNTHAEK 
AGKRGKDMVNOYFGIFFLIF^SGVWGNLISSLVFGQTPSQETL 
PEEQLTS CGASIXXMATTTTNS'PQR PSCX^VYTLLGIYTGSGVli 
AVLM IAAFLQP I RDVQRESE 


6232 


3679 


1476 


PVAGTTMAG FWVGTAPLVAAGRRGRWPPQQLMI*S AAIJRTLKliVL 
YYSROCLMVSRNLGSVGYDPNEKTFDKILVANRGEIACRVIRTC 
KKMGI KTVAlHSDVDA6SVK^n<MADBAVCVGPAPT6KSYLNMDA 
IMEAI KKTRAQAVHP^YGPI»SENKBFARCliAABDWFIGPDTHA 
IQAMGDKIBSKLIJu^KAEVOTIPGPDGVVKnABEAW 
PVMlKASAGGGGKGMRIAWDDEBTRDGFRIiSSQEAASSFGDDRIi 
JjXrvKr J.UJN irKrilBlUVJUUUKHuNAljWJaN IUKRNQKWEE 
APS I F LD AETRRAMG BQA VALARAVKYSSAGTVE FL VDS KKNFY 
FI^Mrn-RI^VBHPVTRCITGIiDLVQEMIRVAKGYPLRHKQADIR 
I NG WAVECRVYAEDP YKS FGLPS IGRLSQYQEPLHLPGVRVDSG 
IQPGSDIS I YYDPMISKLIT YGSDRTBALKRMADALDNYVIRGV 
THNIALLREVI INSRFVKGDISTKFLSDVYPIXJFKGHMLTKSEK 
NQMAIASSLFVAFQIiRAQHFQENSRMPVIKPDIANWELSVKLH 
DKVHTVVASNNGSVFSVBVDGSKLIOTSTWN^ 
QRTVQCliSREAGGlMSIQFIiGXWKVNILTRIiAAELNKFMLE^ 
T£17rSSVWtePMPGW\toVSVKPGDAV^ 
MTAGKTGTVKSVHCQAGDTVGEGDLLVELB 


6233 


1 


2654 


HS TRE NLNAGNFNF PS EGHLVRSTG PGGS F AKHMVAQCVS P KG P 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide " 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aapartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Olycine, 
H-Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P^Proline, Q-Glutaraine, R^Arginine, 
S=Serine, T^Threonine , VaValine, 
N-Trypfcophan, Y=Tyrosine, X- Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=»possible nucleotide insertion) 








liACSRT YFFGATHVP YLGGDS KLPKKTEO I RLLS Q I YAAVI EAV 
IAGIACYAKTSSI.TKAKEVAKQTI£SGIJ>SFELIPFKAALRSKM 
TFHIHAV3^QGRIWriDSBDSI>SFVKTACMAVyDI PDLLGGNGC 
IiGSVWSKSFLTSQtLVKKKDGTVTTETSSVVLTAAVPRFCSWI. 
VEDNEVKLS EKTHQAVRGDES FLGTYLTGGEGAYLYSSNLQSWP 
KEGNVHFFS SGLLFSHCRHGSX I ISKDHMNSISFYDGDSTSTVA 
ALLIDFKS S LLPHLPVHFHGSSNFLMIALFPKS Kl YQAFYS EVF 

KKRSSLIO^SAKLPEIiDWFLQHFAISSISQBPVMRTHLPVLUX} 
AE INTTHRI ESDKVI I S I VTGLPGCHAS EliCAFLVTLHKBCGRW 
MVYRQIMDSSECFHAAHFX}RYLSSAL^QQNRSARQSAYIRKKT 
RLLVVI^YTDVIDWQALQTHPDSNVKASFTIGAITACVBP^ 
CYMEHRFLFPKCIJ>QCSO^LVSNVV^ 

IRAANPAAAFILAKNGIVTRNEDIELILSENSFSSPEMLRSRYT, 
MYTGWYEGKLNAG3VYPI>WQICVWFGRPLEKTO 
SIKPSPFSGNirail^KVKFSDSEPJTMEVCYNTI^SLSIMP^ 
EGPTPPPDSKSVSQDSSGQQECYLVFIGCSLKKDS IKDWLRQSA 
KXJKPORKAIiKTRGMr/lXKiEIRS IHVKKHTjEPLPAGYPYi^QFV 
a r ruU JR. 1 WIT li f JUMDU JrWCULi x v cJSiVitcts j. kjv x nucufiSM'' * nuxjr 
BLKP 


6234 


1731 


404 


PRVREI^HKSPGNKGSLVYAGIKSIVKSSIjGMVESSRHNWSGI* 
DKQSDIQNLNEERIIAI^LOJWI KKGTDVDVGPFLNSLVC^KW 
ERAAAVALFTILDIRRAIQILNEGASSEKGDIjNI^^ 

KVAVRJ3RYAFACKFI^I>TQLNRYIBKLTNEMKEAGNLKGI lltg 

LTKDGVDIME S YVDRTGDVQTAS YCMT .QG S PLDVLKDERVQYW I 

EirrRNGLLDAWRPWIKRAEFDIHRSKLDPS 

KS ISYSCSAVPHQGRGFSQYGVSGSPTKSKVTSCPGCRKPLPRC 

ALCLIHMGTPVGSCPGGTKSDBKVDLSKDKKLAQFNNWF^ 

CRHGGHAGHMLSW F RDHAE CP V3ACTCK(>IQLJ7rrGNIjVPAETV 

QP 


6235 


1 


571 


BKRDHRLPSWPRAALKVPGRGGRVGTTPEIJVAGGIMATRNPPPQ 
DYESDDDS YEVLDLTEYARRHQWWNRVFGHS SGPMVB KYS VATQ 
IVMGGVTGWCAGFLFQKVGKLAATAVGGGPIiLLQ^ 
DWXRVEXDVNKAKRQimU^NKAAPEINNLIEEATBPIKQNIVI 
SSGFVGGFJYU5LAS 


6236 


1 


703 


WDQNKGAAAGSGLTI»PSLPSARF3AGPPTQRSRPTMSNMEKHLF 

NI*KFAAKEI^R5AKKCDKEBKABKAKIKKM 

NAIRQiaJQAVNFLRMSARVmVAARVQfTAVT^ 

SMDATI^TMNLEKISAIjMDKFBBQFETIJ3VQTQQMEDTMSSTTT 

LTTPONQVDMIJ^QEMADEAGXiDLNMELPO^ 

DBLSGJttiARLRDQV 


6237 


312 


720 


PTAMABEGIAAGGVPTOVNTAliQBVLKTAIilHTC 

LDKRQABIjCVIiASNCDEPMYVKLVE-AIjC^ 

GBWVGLCKI DREGKPRKWGCS CVVVKDYGKKSQAKDVT EEYFK 

CKK 


6238 


2 


4666 


ERVPTQESVIWEIJ^VIIKNPEIVFVAJ^XJKNDAPALVITTQC^ 
CYTDGirc»ENSTMTAAIIQ>LQVRACPFI^ 

YQTlYjKGTDPQVII^SVKSLTLKVSPVIINTMITITSALYTTKE 
TIPEBTASSTAiaWEKKI>TK!rL^ 

KGEM I KMN I DS I FI VLFAGIGHRTVPMIiLAKSR FSGEGKNWS SL 
3mHC£LELEVHYraEMFGVWEPLI^ 
MKKJCAKMAIVRSDPREFJIYlCVPEY^ 
LVMLNMLVKAFTSAATGSSAD^ 

DSFSVIOTPMAKSYVI^GESLSMDYIRTKDNDHFNAMTSLSSK 
LFFILLTPVNHSTADKIPLTKVGRRIiYTVPJlRESGVERSIVCQI 
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SEU 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A- Alanine, C=Cysteine, D=Aspartic Acid, B~ 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=>Isoleucine, K=Lysine, 
Ii=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T -Threonine , V«Valine, 
W=Tryptophan r Y=Tyxosine, X«Unknown, *»Stop 
Codon, /^possible nucleotide deletion, 
\= possible nucleotide insertion) 








DTVEGSKKVTIRSPVQIRNHFSVPLSVYBGDTIiLGTASPBNEFW 

I PLGSYRSFI FLKPEDENYQMCEG I DFRE 1 1 KNDGAIiLKKKCRS 

KNPSKBSFLINIVPKKDNLTSI*SVYSBDGWDLPYIMHLWPPILL 

RNLLr^KIAYYXEGTFJWSVFTLSEOH^ICTAQr^ 

LDYLMHDWKSEyHIKPNQQD I S FVS FTCVTEMEK^DIJD XAVIIMT 

YNTGCrTVVAFHSPYWMVmCTGRMI^YKAIX;i 

FSFQPNHFFNNNKVQLfWTDSELSNQFSI 

MDYQVGVTIDLSSFN1TRXVT FTPFYMI KNKSKYHI SVAEEGND 

KWLSLDLBQCIPFWPBYASSKLLIQVERSEDPPKRIYKNKQKNC 

I LI»RLDNEI»GG 1 I ABVNIiAEHS TVI TFI»D YUDGAATFLL INHT K 

NELVQYNQSSLSEIBDSI,PPGKAVFYTWADPVGSRRLKWRCRKS 

HGKVTQKDDMMMPIDI^FJCTIYLVSFFEGLQRIILFTBDPRVFK 

VTYESEKAELAEQEIAVALQ^GISIjVNNYTKQBVAYIGITSSD 

VVWBTKPKKKARWKPMSV KJiTKKI*EREFKBYTESSP3EDKVIQL 

DTjaVPVRLTPTGHNMKILQPHVlAIiROR^^ 

SFRIQIYRIQIQNQIHGAVFPFVFYPVKPPKSVTMDSAPKPETD 

VS rVMRSAGHSQISR I KTlTKVLI QEMDLRLDIXJFIYALTDLMTE 

ABVTKNTBVBLFHKDI EAFKEEYICTASLVDQSQVSIjYEYFHI SP 

I KLHLSVS LSSGRKKAICDS KQNGGL I WliS LNIjLxIiKS I GATLTD 

VQDWFKIAFFELNYQFOTTCDLQSEVJ^^ 

GLDVLGNPFGLIREFSBGVEAFFYBPYQGAIQGPEBPVEGMAIjG 

LKALVGGAVGG LAGAAS KI TCAMAKG VAAMTMDRDYQQKRREAM 

NKQ PAG FRBG ITRGGKGLVSGFVB G ITG I VTKPI KX3AQKGGAAG 

FFKGVGKGLVGAVARPTGGlII»4ASSTFQGIKRATETSEVBSliR 

PPRFFNEIXjVIRPYP^IXJTGNQmiQKIQFYREWIMTHSSSSDD 

DDDDDDBDESDLNH 


6239 


2108 


634 


KPGMAGKGSSGRRPLI*LGLI>VAVATVHLVICPYTKVEESFNLQA 
TKDIXYHWQDLEQYDHLEFPGVVPRTFIJGPVVXAVFSSPAVYVL 
SLLBMSKPYSQLIVRGVIXJLGVIFGLWTI^KF^RHFGAMVATM j 
FCWVTA^PlIIiMFYCTRTLPNVI^PWLLAl^ 

IIiCLGLTVAVDS YFWRQLTW PEGKVLWYNTVLNKS SNWGTSPLL 
WYFYSALPRGLGCSLLFX PT/^VDRRTHAPTVXiAIjGFKALYSIJj 
PHKELR PI I YAFPMLN I TAARGCS YLLNNYKKSWLYKAGSIJjVI 
GHLVVNAAYSATALYVSHFNYPGGVAMQRLHQIjVP PQTDVIJjHI 
DVAAAG^TGVSRFLQVNSAW RYDKREDVQPGTGMLAYTH I LMEAA 
PGIJjALYROTHRVIJVS WGTTGVSLN^^ 
ERLPRPS 


6240 


2202 


1176 


HERGDSLKEPTS IAESSRHPS YRSEPSLBPESFR3PTFGKSFHF 
DPLSSGSRSSSLKSAC^TGF3LG0LQSXRSEGTTSTSYKSLAMQ 
TR^SLSYDSLLTPSDSPDFESVQAGPEPDPPLGYTSPFLSARL 
AQQREAERHPRLVPTGPTHR3PSPWYDNLSRHIVASLQEREKL 
LRQSPPLPGREEBPGLGDSGIQSTPGSGHAPRTSSSSDDSKRSP 
!^KTPIiGRPAVPRFGKPIX5LRGRGVGSPBPGPTAPYlX3RSPlSY3 
SQKAQPGVSBTEEVALQPIiIiTPKDBVQUCTTYSKSNGQPKSLGS 
ASPGPGQPPI»SSPTRGGVKKVSGVGGTTYEISV 


6241 


I 3 


1341 


RNAB E KKRJjSLQREKI IARVS I DNRTRALVQAIiRRTT C P K LC I T 
RVEELTFHLLEFPEGKGVAVXERI I PYLLRLRQI KDETLQAAVR 
EILALIGYVDPVKGRGIRII^IIX^TRGVVALQTLRKLVELTQ 
KPVHQLFDYIC5GVSTGAI IiAFMLGLFHMPIJ5ECEELYRKI/3SDV 
FS0NV1VGTVKMSWSHAFYDSQTWBNILKDR14GSA1MIBTARNP 
TCP KVAAVST IVNRGI TP KAFVFRNYGHFPG INSHYLGG CQYKM 
WQAIRASSAAPGYFAEYALGITOI^QIXXJ 
LWPDVPI^CIVSLGTGRYESDVRNTVTYTSIiKTID^ 
TEEVHIMLDGLLPPDTY^RFNPVMCENI PUOBSRNEKXDQLQLE 
GWCY I BRNEQKMKKVAKI LS QE KTTLQKINDW I KLKTDMYEGLP 
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beginning 
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corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, Cr=Cysteine, D=»Aspartic Acid, E= 
Glutamic Acid, F« Phenylalanine, G=Glycine r 
HaHistidine, I=Isoleucine, K«Lysine, 
L-Leucine, M=Methionine, N=Asparagine , 
P»Proline, Q=Glutamine, R^Arginlne, 
S=Serine, T=*Threonine , V= Valine, - 
W=Tryptophan, Y=Tyrosine, X^Uitknown , *=Stcp 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 








PFSKI* 3 


6242 


198 


1310 


QHFIiPGABTWSPGAAVCTARRFPGRSIiAAFPR.PAAPRRAVEMGE 
S^EDIDQMPSTLLGBMDLLTQSLGVDTLPPPDPNPPRAEFIIYSV 
GPKDIiNBSWIAIjEIKlDLIJALMADLVADI SEAEQRTIQAQKESLQ 
NQHHSASLQAS IPSGAASLGYGTNVAATGISQYEDDLPPPPADP 
VLDLPLPPPPPEPLSQEKSEAQAKADKIKIiAIiKKI^KBAKVKKLV 
VKVHMNDNSTXSfcMVDERQIJV^^ 

IYPELQIERFPEDHENVVEVLSDWTRDTENKILPTJRKBKKYAVP 
KNPQNFYLDNRGKKES KETNBKMNAKNKES LLEVRL I LQSGRKB 
KDVCS IFKS FASENNGKI 


6243 


1509 


614 . 


RSASRFSGCWSKDSTCtXCPSTCWSRSSASCPRARWPPSSAPAT 
TS RASSRRLACG PQTRAGAETRSTAMI RANSAARDTRRATCRSA 
AGTPSPTTMTCLTDVPTGCAAVBPTARLPAAAWASriTTGCCPA 
MGOJVGAGPAGRKGSEAGGGPSRAHHAHPSPLPRBPRVRTGPPAH 
SPTPGSIDPS PEkSWGSAGVTQES PbLDPVDFIiI»FRTRAVDPLR 
R VFFFPYO^TPFSIOPQPPP<=HAFHPRDPPAGTKROI/ILVPLK 
GP P I IAP T I»SI/rP ItSRWSCYFPRSR XAOGWHT »S 


6244 


2119 


1745 


FBHAYASQFGTFIX5NNESKRCKLKIJaQKT^4SLWSWVNQPSELSK 
FTNPLFEANNLVI WPS VAPQSLPLWEGI FLRWNRSSRYLDEAYE 
EMVNI IEYNKKLOAKVNJLRRQLAKLETEEGMQBSP 


6245 


81 


114B 


LSLRNAKYS FPQELISLFSMTDLNDN I CKRY I KMI TNI V ILSLJ 
ICISIAFWIISMTASTYYGNIiRPISPWRWLFSVVVPVLIVSNGL 
KKKSLDHSGALGGDWGPIIjXXANFSFFTSLLMFPIiSSSKLTKW 
KGBV KiU*LDSBYKBGGQKNWVQVFX3gGAVPT^ ' 
EIPVDFSKQYSASWMCI^LLAAIAC^ 

RLI TTWEKVP VGTNGGVTWGLVSS 1XGGTFVGI AYPLTQL I FV 
NDL0ISAPQWPI IAFGGIiAGLLGSI VDSYLGATMQYTGLDESTG 
MWNS PTNKARHI AGKP ILDNNAVNLFS S VL IALLLPTAAWGFW 
PRG 


6246 


1177 


359 


SUNPWILMDOSLMQISLQLLCVYTANFPNGCSSIiCHSSCGQHPV 

(^THRGAVSNSI^LCILKLASQMPLEirc^ 

HIX:KGVIOJKSNFIK2NFI£I»AIiPRGGNKHI£N 

SGETOQQMILRLDGCLDLLTF^K^KHKSSPLLPLLIFHNVCFS 

PANKPKI LANE KV I TVLAACLKSENQNAQRIGAAAL W7^L IYNYQ 

KAKTAIJCSPSVKKRVDEAYSIAKKTFPNSEANPLN^ 

LVOLLNSS 


6247 


3 


1678 


NSRVVfGPWTEPSAGSI^PMARKQNRNSKEIiGLVPLTDDTSIlAGP 
PGPGRALLECI>HI^SGVPGGRRRKDWS CSIiLVASLAGAFGSSFIj 
YGYNLS VVNAPTPYI KAFYNES WERRHGRPIDPDTLTIXWSVTV" 
SIFAIGGLVGTLrVKMIGKVIXSRKOTljliANhra 

i^agafemlivgrfimgidggvalsvlpmylseispkeirgslg 
qvtai fi cigvftgqllglpeij^kbstwpylfgvi vvpavvql 
lslpflpds prylllbkhnkaravkapqtfi^kahvsqbvee vl 
aesrvqrs iriivsvleiirap yvrwqvvtvi vtmactolcglim 
iwfytns i fgkag i p paki pyvti>stgg i etlaavfsglvi ehl 

GRRPLL IGGFGLMGLFFGTLTI TLTLQDHAPWVPYLS IVGI LAI 
IASFCSGPGGIPFIIjTGEFFQQSQRPAAFI iagtvnwlsnfavg 

llfppiqksrjjtycflvfaticitgalylyfvlpetknrtyaei 
sqafskrnkayppeekidsavtdgkingrp 


6248 


56" 


1773 


VPPPPJIMAAVPPGLEPWNRVRIPK^^ 

AAVH KECHLVlliS LKSQTJjDAETDVLCAVIjYSNHNFJ^GRHKPHL 

ALKQVBQCLKRLKNMNIjEGS iqdlfelfssnenqplttkvcwp 

SQPVVEIiVLHKVI^CKIJ^RIil^ I 
L^VMVGL VSRLWVLYKGVLKRIiI I*L YE PLFGLLQEVARIQPM P 
YFKDFTFPSDIlBFX^PYFEAFKKKMPlAFAAKGIIIKLIJmJ 
LINEQSPRASEETItLGISKKAKOWKINVQNNVDI^ 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B~ 
Glutamic Acid, ^Phenylalanine, G=K3lycine, 
H^Histidine , I^Isoleucine, K=Lysine , 
L= Leucine, M=Methionine, CX=Asparagine, 
P= Proline, Q^Glutaraine, R=Arginine, 
S^Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, X^Unknown, *-Stop 
Codon, /^possible nucleotide deletion. 
\apossible nucleotide insertion) i 








KBBSSBFDVRAFCNQLKHKATQETSFDFKCSQSRLKTTKYSSQK 

VIGTPHAKSFVQRFREAESFTQJ^SKBIQMAVVWCRSKKLKAQAI 

FTjGNKId^KSNRLKHLEAQGTSLPKKLECIKTSICN^ 

TSKHHLRQRRSQNKPLRRQRKPQRKIK^TLLRBIQQFSQGTRKS 

ATDTSAKWRLSHCTVHRTDLY pnskqllnsgvbmpvi QTKBKM I 

HKNLiRGIHENETDSWTVMQINKNSTSGTIKETDDIDDIFALI^V 


€249 


56 


1773 


VP P PRMMAAVP PGLE P WNRVRI P KAGIOTS AVTVQNPGAALDLCI 
AAVT KECH1,V1LSLXSQTI>DABTDVLCA^^ 
ALKQVBQCliKRLKNMNLBGS IQDLFEI^SSNENQPLTTKVCVVP 
SQ P WELVI»MlCVLGACKIjIiIjRLLD^^ I 
LNLVMVGLVSRLWVL YKGVLKRL I LL YBPLPGLLQEVAR I QPMP 
YFKDFTFPSDITEFLGQPYFKAFKKKMPIAFAAKGINKTiLNK^ 
LINBQSPRASBETIdiGISKJKAKQMKINV^^ 
KEESSEFDVRAFCNQLKHKATQBTS PDFKCSQSRLKTTKYSSQK 
VI GTPHAKS FVQRFJIKABSFTQIfSEBIQMAVVWCRSKKKKAQAT 
FI^KIJUKSNRIjKHLEAQGTSLPKKIiECIKT^ 
TSIOiHLRQRRSQNKTLRR^RKPQRKXOSTIJjRBlQOFSQGTRKS 
ATDTSAKWRiiSHCTVHRTDLYPNS KQLLNSGVSMPVIQTKEKMI 
HBNLRGIHBNET)DSWTVKQINKNSTSGTIKETO 


£250 


232 


1306 


LAALHIMAJbPFRJQJItBKYKDLDEDKLDQNLSETEIii^ 

LDPENALLPAGFRQKNQTSKSTTGPFDRBH^^ 

REDYVPYTGKKKGKIFIPKQKPVQTFTEKKVSIiDPElaBKALTSA 

SDTEI^IJUUIIXSMHNLITNTKFCNIMGSSKGV 

EKI LPVFDKP PNP'rNVEESI^TKKiniAHLVEVNT.NNI KNI PI P 

TLKD FAKALETMTTHVKC FSLAATRS ^PVATAPAEMLKVMKTLK 

SLNVESNF I TGVGI LAL IDALRDNETLAELKIDNQRQQLGTAVE 

LEMAXMLEBNTNILKFGYQFTQQGPRTRAANA 

VBQDHQ 




62 


972 


TPGSGPMSAWAAASLSRAAARCliliARGPGVRAAPPRDPRPSHPE ' 
PRGCGAAPGRTLHFTAAVPAGHNKWSK^TRHIKGPKDVERSRI FS 
KLCIJJIRLAVKEGGPNPEHNSNLANIIiF^ 

raffiKSmTYI^YEGRGPGGS^IEAI^SSHKCOADiRHILNK 
NGGVMAVGARHS FDKKGVXVVEVEDREKKAVNLERALEMA IKAG 
AEDVKETBDBBERWFKFICDASSLHQVRKKLOSIjGLCSVSCAIj 
BFIPNSKVQIJ^PDLEOAAHLIOAIiSNHFIWIKVYmiB 


6252 


27 


1897 


befctwiavrvgemetapkpgkdvp pkkdklqtkrkkprrywee 
btvptt/u^^pgpprnkknrblrpqrpknayilxksrjskkpqv 
pkkprbwknpesqrglsgaqdpfpgpapvpvewqkfcridksr 
klphs kaktrsrlevaeabeeets ikaarselliiaeepgflegk 
dgedtakicxjadiveavd iasaakhfdlnlrqfgpyrlnysrtg 
rhlafggrj^ghvaald wvtkklmcb invme^ 
avaqnrwlhiydnqgiki 1 hcirrcx>rvtrlbflpfhfllatase 
tgfltyiidvsvgkivaalnaragrldvmsqnpynavthlghsng 
tvslws pamkk plakiiichrggvravavdstgtymatsg^ 

gck^kasppsleqpylthri^gpvhglqf^f^ 
smlvpgagepnfix5Lf^pyrsrbcqrqe»evkalij;kvpablic 
ldpratjvbvdvisleqgkkeqierlgydpqakapfqpkpkqkgr 
sstaslvkrkrkvmdebhiu)kvrqslqqqh^ 

ALDRFVR 


6253 


27 


1897 


BEFC^IAVRVGEMBTAPKPGKDWPKKDKLQTKRKKPRRYWEE 
ETVPTTAGASPGPPRNKKNRELRPQRPKHAYILKKSRISKKPQV 
PKKPREWKWPESQRGLSGAQDPFPGPAPVPVBWQKFCRIDKSR 
KLPHS KAKTRSRLEVAEABEEETS IKAARSELLLAEEPGFLEGE 
DGFjyTAKIC^^IVBAVDIASAAKHFDIJILRQFGPYRLNYSRTG 
RHIjAFGGRRGHVAAIjDWVTKRIiMCEINVMEAV 
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ID 
MO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A^Alanine, C«Cy»teine, D=Aspartic Acid, B=* 
Glutamic Acid, F-Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K= Lysine, 
I^Leucine, M=Methionine, NrAsparagine. 
P=Proline, Q-Glut amine, R=Arginine, 
S=sSerine, T=Tnreonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\-possible nucleotide insertion) 








AVAQNRWLH I YDNQGI BLHCIRRCDRVTRLEFLPFH FLLATASB 
TGFIiTYLDVS VGKI VAALblARAGRLDVMSQNPYNAV IHLGHSNQ 
TV^WSPAMKEPLAiaLaiRGGVRAVAVDSTGTYMATSGLDHQL 
KIPDI^TYQPI^TRT^PHGAGHIAFSQRGLLVAGMGDVVNIWA 

SM LVPGAGE PNFDGTjESNPYRS RKQRQE WRVKAIJjEKVP AELI C 
IjDPRALABVDVIS IiEQGKKEQIBRLGYDPQAKAPFQPKPKQKGR 
SSIT^VKRKRKVMDBEHRDKVRQSLQQQHHKEAKAKPTGARPS 
ALDRFVR 


6254 


155 


1139 


HAIjGRI^SQELSAAACKCEALRIJIAPGSGRPALA 
GGAPRFPPRGSAAGRTMLIJCEYRIWPLTVDEYKIGQLYMISKH 
SHEQSDRGEGVEWQNEPFEDPHHGNGQFTEKRVYLNS KLPSWA 








KGSNDTI FDNEAKDVEREVCFIDIACDEI PERYYKESEDPKHFK 
SBKTGRGQLREGWRDSHQPlMCSY^VTVKPEVWGTiQTRVEQFV 
HXVVRDIIiLIGHRQAFAWVDEIWDMTMDDV^ 
VCKQHSS PVDDI BSHAQTST 


6255 


1 


1444 


PTRPOQELLVSLATVI FVASQKAI^ VESKAVI KQQLESVSNGWT 
V YR I ARQASRMGNHDMA KK LYQS LLTQ VA3 KH FYFWLNS IiKE FS 
HAEQCLTGLQEENYSSAliSCIAESLKFYlJKGIASLTAASTPLNP 
I^FC^FVKLRIDIjIjQAFSQL I CTCNSLKTS PP PAIATT 1AMTL 
GNDLQRCGRI SNOMKQ SMEEFR3LASR YGDL YQAS FDADSATLR 
NVELQQQSCLLISHAIEAIilLDPESASFQEYGSTGTAHADSEYE 
RRMMSVYNHVLEEVESI^KYTPVSYMHTACXC^IIALLKVPL 
S FQRYFFQKLQSTS IKlJ\l^PSPRNPAEPIAVQNNQ^liAliKVEG 
W0KGSKPGLFRJC1QSVCLNVSSTLQSKSGQDYKI PIDNMTNEM 
EQRVEPHNDYFSTQFI^FAIIXmiNITVESSVXDANGIVWKTG 
PK A ll FVKSIjKDPYSQQIKijQQQQAP^FIjQyPuQ^ IKr 


6256 


1 


"" 1542 


CRGAGAEPAANPRSPRSLVPSLBSTSTSVPPAPGTMATDSWAIiA 

VDEQEAAAESLSNLHLKKSKIKFITmGAWKTNA^ 

EDRAAQSI*imi.IRSira,VI»^ 

TGKTAAFVIJ^l^QVEPANKYPQCLCLSP^ 

GKFYPBI.KIiAYAVRGNKLBR^KISBQIVIGTPGTVLIWCSiaK 

FIDPKKIKVFVLDSAIWMIATQGBQDQSIRIQRMLPRKG^QsLF 

8ATFBDSVWKFAQKVVPDPNVIKLKREEETrJ)TIRQYY 

uBKFQAIOfoYGAlTIAQJ\MIFC^^ 

LI^^EMMVBQRAAVTIKRFREGIQSKVLVTTNVCARG idveqvsvv 
INFDLPVDKDGN PDNETYLHRIGRTGR FGKRGLAVNMVDSKHSM 
MILNR IQEHFNKKI ERLDTDDLDEIE KIAN 


6257 


210 


615 


APIPAMABLIQKJO^EVBKYQQLQKDLSKSMSGRQKLEAQLTE 
NNIVKEELALLIX^NVWKIJiGPV^ 

TAB IKRYESQLRDIiERQSEQQRETriAQLQQB FQKAQAAKAGAPG 
KA 


6258 


210 


615 


AFIPAMAEIjIQKKIjCX5EVEICYQQI^ 

NNTVKEEIJuUJX5SNWFKLI73PVLVKQELGBA^ 

TAE I KPYBSQLPJ3LERG^BO^RKTIAQliQ0B FQRAQAAKAGAPG 

KA 


6259 


2 


1540 


II^EKGFPSQCHPERKWKVDDVLBSSQF^KDDHFWBLLFR^KTV 
SVBITCDRGSKTFNI^DPVSLRNYPYXI CDSCEMNLKNTSGLII 
SKKNCSRKKPDEFNVCBKLLLDlRlffiKIPIGEKSyKYWKR^IAI 
NYHQDLSQPS FGQS FE YSKNGC<3PHDBAAFFTNKRSQIGETVCK 
YNECGRTFIESLKLNT SQRPHLBMEPYGCS ICGKSFCMNLRFGH 
QRALTKDNP YEYNE YGEIFCDNSAFI IHQGAYTRKILREYKVSD 
KTWEKSALLKHQIVHMGGKSYDYNENGSNPSKKSHLTQLRRAHT 
GEKTFECGBCX^FWEKSNIiTQHQRTTTOKKPYECTECGKAFCQ 
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NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P^Phenylalanine, G=Glycine r 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N-Asparagine, 
P~Proline, Q=Glutamine, R=Arginine, 
S=Serine f T»Threonine, V= Valine, 
W=Tryptophan, Y=Tyxosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








KPHLTNHQRTHTCE KP Y ECKQCXiKTFCVKSNLTEHQRTlITGEKP 
YE CNACGKS FCHRS ALTVHQRTHTGBKPF I CNE CGKS FCVKSNXj 
rVHQRTHTGEKPYKCWBCX^KTFCEKSALTKIIQRTHTGEKPYECN 
ACGKTFSQRS VLTKHQR IHTRVKAL3TS 


6260 


2081 


1436 


GTGPEIHACAHASARAPGSRAMJfu^REIiKVC^ 
RJTEDSFDPNINPTIGASFMTKTVQYQNELHKFLIWDTAGQERF 
RALAPMYYRGSAA7VIIVYDITKEHTFSTLKNWVKELRQHGPPNI 
WA TAGNKCBL ICVRBVMERDAKD YADS IHAI FVBTSAKNAINI 
NELFIEISRRIPSTnANLPSGGKGFKLRRQPSEPKRSCC 


6261 


3 


1188 


FWYRLGPGTRSRWPRRGSHAASLVPRGPSPAALVTSPCPPDPLR 
SPACEPCMDFAPRPAIiLriRSGPRSAPAVTGKPALKGQPGPWPG 
MAEV3II^SKI*PGVKEVC^U>FAVLEDHTLAHSLQEQE IEHHLAS 
NVORNRLVQHDLQVAXQIiQE^DLKAQAQLQKRYKDLEQQDCEIA 
QBIQEKIAIBAERRRIQEKKDEDIARlJX}EKEIX3EbKKRJCKHFP 
EFPATPJ^YADSYYYEDGGMKPRVMKBAVSTPSPJIAHRDQEWYDA 
E IARKLQERBLLATQVDMRAAQVAQDE E IARLLMAEEKKAYKKA 
KERBKSSLDKRKQDPBW KPKTAKAANS KS KRSPE PHH3KNERP A 
RPPPPIMTbGBDADYTHPTOQQSSTRHFSKSESSHKGFHYKH 


6262 


2 


1759 


PEGHSCGIjCSVHRPGKVPOARI^bVLGQRDEPAGHRI^QKKIL 
GSTRLVSQGLBALRSBHQAVIiQSTtSQTIECLQQGGHEEGLVHBK 
ARQT.RRSMBNIELGIjSEAQVMLAIJVSH^ 

IX!QENQWLPvDELAGTQQRI^RSEQAVAQLEEEKKin^EFLGQLRQ 

YDEIX^SBEKEGDATKDSLDDLFPlfBEEEDPSiroLSRGQGArA 

AQQ^YEIPARLRrLHNLVIQYAAQ^RYEVAWI^^ 

TSGRGHPDVATMIOTIJa.VYlU)QNKYKBAW 

GPDHPAVAATLNNLAVI^YGKRGKYKEAEPLCQRArjEIREKVLiGT 

NHPDVAKQLNNLALLCONOGEGfEAVERYYQRAIiAIYEGQLGPDN 

PNVARTKNNLASCYIiKQGKYAEAETLYKEILTRAHVQEFGSVDD 

DHK P I WMHAEEREEM S KS RHHEGGTP YAEYGGWYKACKVS S PTV 

NTTIJilirE/SALYRRQGKLEAAETLEBC^ 

EUX5ESDGRRTS0EGPGDSVKPEGGEDASVAVEWSGDGSGTLQR 
SGSLGKIRDVLRR 


6263 


1 


2408 


RELDS LADLPBR I KP P YANGLSTSHljRS S S VKD VKLiI ISEGRPT 

IEVRRCSMPSVICEHTKQFQriSBBSNQGSLLTVPGDTSPSPKP 

EVFSNVPERDLSNVSN I HSSFATSPTGASNSKXVSADRNL I KNT 

APVMTVMDSPVBliBPSSQVGVIQNKSHEMP^ 

PNSNIPDQESSLQS FCNSENK^K^KADFr^I^QTELPGMSCAQ 

DPASFMPPQQPCSFPSQSLSDAESISKHMSLSYVANQEPGILQQ 

KNAVQIISSAIOTDNBSTKXJT^^ 

IQEASPNPBKAyTIiPVLPSEKDFWGSDASTQIOTHYAPSKLTYK 
SSSGHEVENSTTDTQVI SHEKBNKIiESLVLTHLSRCDSDIiCEMW 
AGWPKX^tJiBQDPKHCPESEKCTiTjSTRDRRSQQS ILSSLKMHSQ 
QSTQPEMHKYGQLVKVELEENABDDKTENQI PQRMTRNKANTMA 
NQSKQILASCTIJLSBKDSESSSPRGRIRLTBDDDPQIHHPRKRX 

Vt>KV FU*' Vy VSPSIdjQfUUSKlV^JliAAIVUS^^ IQP YSSER 

ANP Y FBYLH IPJCKI EEKRKLLCSVI PQAPQVWEYVTFNGS YLL 
IXMPLSKICIPTITPPPSLSDPUCEIJp^QQEVVRMICLRX^ 
RBKL I VSNEQEVIjRVirYRAARTLANQTLP ACTVLLDAEVYNV 
PU)SQSDDSKTSVRDRFHARQFMSWLQDTO^ 
QHEAAALNAVQRLEWQLKLQELDPATYKSI S IYEIQEFYVPLVD 
VNDDFELTPI 


6264 


143 


I960 


KHRQEHMALDMAPE IRMTGPMCL IENTNGEL VANPEALKI LS A I 
TQFVVWArVGLYRTGKiS YLMNKIAGKNKGFSI^ TVKSHTKG I 
HMWCVPlD?Kia?EHTLVIAmTOLGDVKKG^ 
SSTLVYNSMGTINQOAMDQLYYVTEIiTHRIRSKSS PDENENEDS 
ADFVSFFPDFVWTLRDFSLDLEADGQPLTPDKYLEYSLKLTOGT 
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beginning 
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corresponding 
to first 
anu.no acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, c=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H^Histidine, I=*Isoleucine, K=I»ysine, 
L-Leucine, M-Methionine, N»Asparagine, 
P=»Proline, Q=Glutamine, R=Arginine, 
o -oerine, jtaxnreoaiim, v—vajLUie, 
W-Tryptophan, Y=Tyroaine, X=u*nknown f *=Stop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 








SQKDKHRn»PRLCIRKFFPKKKCF\n^ 

EI^PEFVQQVADPCSYIPSNSKTKTI^GGIKVNGPlUiESLVLTY 
ItJAJSRGDLPCMET^VLAIiAQlENSAA 

LPAETLQKLLDLHRVSEREATBVYMKNS FKDVDHLFQKRLAAQL 
DKKRDDFCKQNQEAS SDRCSALLQVI FSPLEEBVKAGIYSKPGG 
YCXFIQKIjQDI^KKYYEEPRKGIQAEBIIj<yrYTjKSlCESVTDAIL 
QTIXJI LTEKEKEIEVECVKAESAQASAKMVEEKQI KYQC^'IMEE K 
BKSYQBh^XQI>TBKMERERAQl,I^QBKTI>TSKIiQEQAKVI>KBR 
CQGESTQLQNE IQKLQKTLKKKTKH YMSHKLKI 


6265 


143 


1960 


KKRQENRW.DMAPEIHMTGPMC^IENTNGELVAHPEALKILSAI 
TQPWVVAIVGLYRTCKSYIMNKLAGKI^ 
VWWCVPHPJCKPEHTI*VI»LDTEGLGDVKKGDNQNr>SWI FTLAVLL 
SSTLVYNSMGTINQQAMDQI>YYVTBLTHK r RSKSSPDENENBDS 
ADFVSFFPDFVWTLRI)FSrj>LEADGQPLTPDEYIiBYSLKLTQGT 
SQKDKNFNLPRIjCIRKFPPKKKCFVPDIjP ihrrxlaqleklqde 
E1.DPEFVQQVADFCS YI FSNSKTKTLSGG I KVNOPRLES LVLTY 
INAXSRGDLPOMENAVIJU^AQIB^ 

LPAETLQELLDLHRVSEREATBVYMKNS PKDVDHLFQKKLAAQL 
DKKRDDFCKQNQBAS SDRCSALLQVI PS PLEEEVKAGI YSKPGG 
Y f 'T »FI QKLQDLEKKY YE E PR KG I QAEE I TiQTYLKS KK S VTDAIL 
QTDQI LTEKEKE IBVECVKAES AQASAKMVE^JQIKYQQMMEE K 
EKSYQEHVXQLTEKMBRERAQLLEEQRKTLTSKLQEQARVLKER 
CXX?ESTQliQNEIQKLQKTLKKKTKRYMSHKLO 


G266 


276 


1421 


GSHQKSQMLVPCFLYSI^NPJCPSLYG 

GFTVNEINKKS IHI6 CPKENASSKFLAPYTTFSRIHTKS 1TCLD 
I SSRGGLGVSSSTDGTMKJWQASNGELPJIVLEC3HVPDVMCCRFF 
PSGLWLSGGMnAQLKI WSAEDAS CWTFKGHKGGI LDTAI VDR 
GRNWSASRDGTAFXWDCGRSACXGVLADC^ 
S IHl&SPEQMPSEKBVGTEAKMLiiliftj^ 
LFIGSDAFNCCTFIiSGFTiliIiAGTQ^ 

RSGA PVX»SLLSVRDGFIASQGDGS CTIVQQDLDYVTELTGADCD 
PVYKVATWEKQIYTCCRDGLVRRYQLSDL 




3 


622 


IX}MMKKNNSAKRGPQDGNQQPAPPEKVGWVRKFCGKGIFREIWK 
NRYVVIJ<Gl)QLYI3EKEVKDEKiaQEVFDI^DYEKCEELRKSKS 
R S KKNHS KFTLAHS KQ PG NTAPN h I FL»A VS P EE KSS W I NALNS A 
ITRAKNRILDEVTVEEDSYIJ«1PTPJ3PJVKI0HSRRPPTRGHLWA 
VAS TSTSDGMLTLDL IQEEDPSPE BPTSLC 


626B 


160 


1368 


KRRLCQNLPAGLSSALlbNPLTlJjLSI 

DFSREEWGLl^PTQRTEYRDVMLETroHLVSVGWBTTLBNKBLA 
PNSDIPEEBPAPSLKVQBSSPJDCAI^STLEDTI^GGVQBVQDTV 
LKQMBSAQEKDLPQKKHFDNRBSQANSGAU>TNQVSLQKIDNPE 
SQANSOAIiDTNQVLLHKI PPRKPJ^PJO^DSQVKSMKHNSRVKIHQ 
KSCERQKAKEGNGCRKT F3RSTKQ IT FIRIHKGSQVCRCSECGK 
I FRNPRYFSVHKKIHTGBRP YVCQDCX5KGFVQSSSLTQHQRVHS 
GBRPFBCQECGRTFNDRSAISQHIJIT^^ 
SSHLIRHQRTK1X3ERPYACNKCGKAPT0^SHLIGHQRTMTOTKR 
KKKQPTS 


6269 


2886 


1449 


KASAPTRRNMAAASPI^DCHAWIOJARLPIiSTT 

TQYVKWTNDKSI/^IEGCIjSKLKAADPTFVMGHaMATGL 

GSS VKLDKELDLAVKTMVE I SRTQPLTRREQLH VS AVETFANGN 

FPKACEI^EQILQDHPTDMUu^KFSHDAYFYLGYQEQMRDSVAR 

I YP FWTPDIPLSSYVKGI YS FT3LMETNFYDQAE KIA3CEALS INP 

TDAWSVHTVAHIHEMKAB I KDGLBFMQHSBTLWKDSIMLACHNY 

WHWALYLI EKGE YEAALT I YDTHI LP SLOANDAMLDVV0SCSML 

YRLQMEGVSVGQRWQDVLPVARKHSRDHILLFNDAHFLWASLGA 

HDPffTTQBIjyrTIiRBA^^ 
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sequence 


Amino acid segment containing signal peptide 
(A<*Alanine, C=Cysteine, D=»Aspartic Acid, E= j 
Glutamic Acid, F=Phenylalanine, G*Glycine, 
H=Histidine, I=Isoleucine, K^Lysine, 
I^Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
SsSerine, T=Threonine, V^Valine, 
W^Tryptophau, Y^Tyroeine, X«unknown, *=Stop 
Codon, /=«possible nucleotide deletion, 
\«possible nucleotide insertion) 








DGNPDR VLELLLP I RYRJ VQUjGSNAQRD VFNQIit, I HAALNCTS 
SVHKNVARSLI^RDALKPNSPLTKRLIRKAATVHLMQ 


6270 


23 


2086 


SVTVTLGSEGPGRPPTYHLBBMEQKPQNGEPAEIKIIREAYKKA 
PLPWKGI^TDET>GQKEEAKNYYKQGIGHLLRGISISSKESEHT 
GPGWBSARQMQOKMKETLQISrVRTRtB ILEKGLATS LQNDLQBVP 
KLYPEFPPKDMCBKLPEPQSFSSAPQHAEVNGNTSTPSAGAVAA 
PASLS bPSQSCPAEAPPAYTPQAAEGHYTVSYGTDSGEFSSVGE 
EFYRNHSQP PPLBTLGLDADBL I L»I PNGVQ I FFVNPAGEVS APS 
YPGYIJttVRFLI)NSLm*VIMPPGFI>Q^ 
CTAGAYM FPDTMLQAAGC3?VGVV^ 

DLRI^QANWNRAKEENE FQIPGRTRPSSDQLKEASGTDVKQLDQG 
NK15VRHKGKRGKRAKDTSSEEVNLSHIVPCEPVPEEKPKBLPEW 
SBKVAHNILSGASWVSWGI,VKGAEITGKAIQKGASKLRERIQPB 
EKPVBVSPAVTKGt*VIAKQATXMAAKVSQFLVIX^CTVANCVGK 
ELAPHVKKHGSKLVPES LKKDKDGKSPLDGAMVVAAS S VQG FST 
VWQGLEC3UlKCIVNNVSABTVO^I^Y 

KVGVTAYN 1NNIGI KAMVKKTATQTGHTLLEDYQI VDNSQRENQ 
BGAANVNVRGEKDEQTKEVKEAKKKDK 


6271 


32 


1058 


GCGVKTAGMVGREKitbS I HFVPG S CRLVEEEVNI PNRRVXiVTGA 

TGIiGRAVfflCBFtXTNNWHAVGC^ 

I IITOP^PHVIVHCAAERRPDWENQPDAASQUn^^ 

AAVGAFLI Y I SSDYV51)GTI^P YREEDI PAPLNLYGKTKXiDGEK 

AVLKNNLGAAVLRI PI IiYGEVB K LEES AVTVWFDKVQ FSNKSAN 

MDHWQQRFPTHVKDVATVCRQLAEKRMIiDPSIKGTPHWSGNEQM 

TKYEMACAXADAFNLPS SHLRPI TDS P VLGAQRPRNAQ LDCS Kl> 

ETLGrGQRTPPRtGlKESLWPFLIDKRWRQTVFH 


6272 




528 


GAVMEDAAAPGRTEGVLERCX^PAAGQGGALVELTPTPGGUUj 

VSPYHTHRAGDPLDLVAlJ^QVQKADEFIRRNATim.TVIAEQI 

QHUJEQARKVLEJQAHRDANIiHHVAC^IVKKPGWIYY^^ 

QYPS I ISPKEWGTSCPHDFLGAYKIK}HDLSWTP YEDIEKQDAKI 

SMMDTLLSQSVALPPCTBPNFQGLTH 


6273 


256 


843 


SCPRVSPECRSLG^VMFSLPl^C^PDHrRRGSCWGRPQDLKIA 
SAAWNS KCHPGAGAAMARQHAKl liWXDRPR xv FMEFCV KD3TVV 
HVLI BDHR I VPS CKNADG VELYNB I E FYAKVNS KDS QDKRS S RS 
ITCFVRKWKBKVAW PRLTKKDIKPVWLSVDFDNWRDVfEGDKEME 
LAHVEHYAEVRDKTYCVLPT 


6274 


56 


1142 


AAAAMAAAAGGGAGAARSLSRFRGCIAGAIJ^IX^GSFYEAHOT 

vt/i-»l £> VijrtH VyoJuOl Ursa! ruDCKJtb/Uil X llJViAnHK*U>VyoiJ 

LAKEAFDEVDMAHRPAQBYKKDPDRGYGAGVV^ 

DVFE PARAQ FNGKGS YGNGGAMRVAG I SLAYS SVQDVQKFARLS 

AQLTmSSTvGYNGArU^I^VHIiMXJGKSSSKHFLKQI^GHMED 

LBGDAQSVI»EARELGMBERPYSSRLKKIG^ 

EDGNGIAAFESVPTAIYCFLRCKBPDPKIPSAFNSLQRTLIYSI 

SWXSDTDTIATMAGAlTyGAYYGMDOV^ 

QSLHRVFQKS 


6275 


20 


""' 565 


SRRGRARCJ^GSRRPVPRPAKTMAPWVKTWGGQLKNLTGSLG 

GGEDKGDGDKSAAEAQGMSREEYEEYQKQLVEEKMBRDAQFTQR 

KAERATIJlSHFRJDKraLPKNETDESQIQMAGGDVEIjPRBlJ^I 

BElYIBEEEEKASVLG^LASLPGtiNI^ 

KCHVN 


6276 


797 


97 


TLI>PI»PPt>PlTrBGMII»Ltnral»EGTVAE^^ 
SClJQQIATHPGHWGIHI#IAEPAAL^ 

VWVGAKISHGSFSVPGHVAGREI^TAVAEVFPHVTVAPGWPBEV 
I^SGYllEQIXTDMI^LCC^LWQPVSFQMQAMLLGHSTAGAIGRL 
IASSPRATVTVEHNPAGGDYASV^TAIiTAARAVDRTRVYYRLPQ 
GYHKDLLAHVGRIf 
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SEQ 
ID 

NO: 


predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C-Cysteine, D=Aapartic Acid, E«= 
Glutamic Acid, F=*Phenyl alanine, G=Glycine, 
H»Histidine, I=Isoleucine, X=Lysine, 
ImLeucine, M=Methionine, N-Asparagine, 
P^Proline, Q«Gl ut amine , R-Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X«Unknown, **Stop 
Codon, /=possible nucleotide deletion, 
\=poseil>le nucleotide insertion) 


6277 


4600 


2744 


MAFRTE MGLY YS YFKT I VEAPSFLNG VWMIMNDKLTEY PL VI; NT 
IdKRFNEiYPBVI LASWYRI YTKIMDLIGIQTKI CWTVTIGEGI»SP 
TBSC^LGDPACFY^VIFlI^LmAI^lYGTYI^SRI^GL 
VTVLCFFFmJGECTRVMVn'PP^SFSYPFLVI^I^VTHILRA 
TKLYRGSLIALCISNVFFMLPWQFAQFVIXTQIASLFAVYVVQY 
IDICKLRXllYIHMISI^CFVI^FGNSMbLTSYYASSLVIIWG 
IIJiMKPHFLKINVSEXSLWVIQGCFWIiFGTVILKYLTSKIFGlA 
NDAHIGNLLTS K?FSYKDFDTLLYTCAAKFDFMKK3TPLRYTKX 
liLLPWLVGFVAIVRKI I SDJWGVLAKQQTHVRKHQFDHGKLVY 
HALQIJ^YTAIX5ILlMRLKL?LTPHMCVMASLtCSRQLFGWLFC 
KVHPGAIVFAIIiflAMSICX?SANrOTQWIVGKFSNXiPQEBI.IEW 
JKYSTKPDAVF^AMPTMASVKLSALRPIVHHPHYBDJ^LRART 
KTVYSMYSRKAAEEVKRELIKLKVNYYII^ESWCVRRSKPGCSM 
PEIWDVBDPANAGKTPLCNI^VKDSKPHPTTVFQNSVYKVLEVV 
KE 


6278 


3 


823 


ILFPJbVL^LVYIiLNSVATEKRJO'ABVLIVEGQQYAVVGTVLLL 
IRIII*E YCQGVDNIPSVTTDMLTIUjSDLI»KYFNSRS CQLVI/5AG 
ALQVVGI^riTKlftjALSSRCI jQI>IVHY I PVTRAHFEARLPPKQ 
YSMIiRHFDHITKDYRaiHIAE ISAKLVAIMDSLPDKLLSKYBVXA 
PVP^CFRI^CICQMTKMHHAIFDLLPEKQTQMLFIJR.INASYK^ 
LKKQLSHLNV1 NDGG PQ!TCI»VTADVAFYTGNLQALKGLKDIJ)LN 
MABIWKQKR 


6279 


127 


1687 


GGAMASDGARKQ FW KRSNS KL PGS IQHV YGAQHPP FDPIxLHGTL 

LRSTAKMPTTPVKAIOlVSTFQEFESNTSDAtroAGm)DDE^AMA 

AKSI^SEVVMET7^VLRNHSQRQGRPTLQBGPGLCK?KPRPEAE 

PPSPPSGDIiRLVKSVSBSHTSCPAESASDAAPlJDRSQSIJPHSAT 

VTLGGTSDPS'ri^SSALSF-REASRliDKFKQLrAGPNTDLKELRR 

LSWSGIPKPVRPMTWKLLSGYLPANVDRRPATLQRKQK2YFAFI 

EHYYDSKHDBVHQDTYRQIHIDIPRMSPEALILQPKVTEIFBRI 

I^IWAIRHPASGYVQGI^LVTPPPWFICEYIET^EVDTVDVS 

GVPAEVI^IEADTYWCMSFOiLDGIC^^ 

BLVSRII>EQVHRKIiI>QHBVRYI>QFAFRW^^ 

LW>TY0SKPDGFSHFHLYVC7UVFLVRWRJCBILEEKDFQELIJ^FL 

QNLPTAHWDDEDISLLLAEAYRIiKFAPADAPNHYKX 


6280 


857 


2515 


KCX3DOKMGSRNSSSAGSGSGDPSEGLPRRGAGLRRSEBEEKEDK 

DVDLAQVXAYIJ^GQVRLVQGGGAAN^ 

WDGMX3DRYNPPVDATPDTRELEFl!iEIKTQVBIA 

QKHSFPRMLHQRKRGLCHRG3F3LGEQSRVI SHFLPNDLGFTDS 

YSQKAFCGI YSKDGQIFMS AOQDQTI RLYDCRYGRFRKFKS IKA 

RDVGWSVLDVAFTPDGNH FLYSS WSD YIH I CN I YG EGDTHTALD 

I&PDERRFAVFSIAVSSDGRF^GGANIXSCLYVF^^ 

IESHBDDVNAVAFADISSQILFSGGDDAICKVWDRRTMREDDPK 

PVGAIAGHQDG I TF I DS KGDAR YLI SNSKDQTIKLWD IRRFSSR 

BGMEASROJUVTQQNVn3YRWQ^VPK3(A^ 

GVLHTT,IRCRFSPIHSTGO^FIYS(K^TGiCVVVYDU^ 

LTNlIKACVRDVSWHPFBEKIVSSSWDGNIiRLWQYRQAEYFQDDM 

PESEECASAPAPVPQSSTPFSSPQ 


6281 


857 


2515 


ECCDQKM6SRNSSSAGSGSGDPSEGLPRRGAGLRRSEEBEEED& 
DVDIAQVLAYLLRRGQVRI.VQGGGAANLQFIQALLDSKKENDPA 
WDGRLGDRYNP PVDATPDTREI »EFNE I KTQVELATG QI/5LRRAA 
QKHSFPRMLHQRERGLCHRGSFSLGEQSRVISHFLPliDLGFTDS 
YSQKAFCG I YS KDGQI FNSACQDQTIRL YDCRYGRFRKFKS I KA 
RDVGWSVLDVAFTFDGNHFbYSSWSBYIHlCNIYGEGDTirrALD 
LRPDERRFAVFS lAVSSIX^RF^LGGANDGCLYVFDREQNRRTLQ 
I ESHEDDVNAVAFADXSS Q I L FSGGDDAI CKVWDRRTMRKDD PK 
PVGAIAGHQDGITFIDSKGDARYX1SUSKDQITIKLWDIRRFSSR 
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SKQ 

JLJJ 

NO: 


Predicted 
beginning 
nucleotide 
location 
corre spending 
to first 
amino acid 

LtiSXQUc OX 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanirie, C—Cysteine, D=Aspairfcic Acid, £b 
Glutamic Acid, F=Phenylalaniiie, G=Glycine, 
H-Histidine, I^Isoleucine, K=Lysine, 
I»-Leucine, M=Methionine, N^Asparagine r 
P=Proline, Q=Glut amine, R=«Arginine , 
S=Serine, T=Threonine, V=>Valine t 
W=;Trvnt'rtnhan YnTvrafORTiie* . X=Un known *s*Sfcon 
Codon, /^possible nucleotide deletion, 
\=posaible nucleotide insertion) 








EGMEASRQAATQQNWDYRWCWVPKKAWRKLKLPGnSSLMTyRGH 
GVLHTLIRCRFSP IHSTGQQFI YSGCSTGKVWYDliLSGHI VKK 
LTNIIKACVRDVSWHPFEE KIVSSS WDO^LRLWQ YRQASYFQDDM 
PESEECASAPAPVPQSSTPFSSPQ 


6282 


125 


nr\f 

906 


HKAAUKAliKAV LiVlJJji5ti , i'JjJttJ.KLVVAV iAiAyBAIjKJtJjKtiAv? V X IK 
FVTNTTKBS KQDLLERLRKLBFDI5EDEI FTSLiTAARS LLERKQ 
VRPI^VDDRALPDFKGIQTSDPNAVVMGrAPEHFHYQ I LNC3AF 
RLLLDGAPL IA1 KKAR YYKRKDGLAEG PFVTALEYATDTKAT 
VVGKPKKTrFFLEAlJ^TGCKPEEAVMIGDDCRDDVGGAQDVGML 
GILVKTGKYRASDEEKINPPPYjbTCESFPHAVDHILQHLL 


6283 


140 


104* 


LSIiFGIHVMNPFWSMSTSSVRKRSBGEBKTLTGDVKTSPPRTAP 
KKQLPSIPKNALPITKPTSPAPAAQSTNGTHASYGPFYLEYSLL 
ABFTLVVKQKLPG VYVQPS YRSALMWFGVI PIRHGLYQDGVFKF 
TVYIPD^PDGDCPRLVPDIPVFHPLVDPTSGEriDVKRAFAKWR 
PJNHNia^VLMYARRVFYKIDTASPI^PHAAVLYKKDIQLFKSK 
WDSVKVCTARLFDQPKIEDPYAIS FSPWNPSVHDEAREKMLTQ 
KKKPEBQHNKSVHVAGLSWVKPGSVQPFSKEKKTVAT 


62B4 


1 


287^ 


RSVIPGSTISSRNPGLSRPRFMAAHEWDWFQREEIjIGQISDrRV 

C^I^VERF3WQKRTrTRWINIiHTiRKCNPPLEVKDLFVDIODGKI 

imLLEVl^GRNrJJIKYKSSSIIRiraLNNlAKM^KFr.EDSNVKr, 

VSIDAAEIADGNPSLVIiGLrWNIILFFQIKEliTGNIiSRNS MSSS 

LAIH3SGGTDSDSS FPPTPTAERSVAI SVKDQRKAIKAELAWVQR 

KTRKYGVAVQDFAGSWRSGIAFIAVIKA1DPSLVWKQAIJBNST 

REKLEKAFSIAQDATJIIPRliEPEDIMVDTPDEQSIMTYVAQFL 

ERFPEIiEAEDI PDSDKEVP I EST FVR I KETPSBQESKVFVLTEN 

GERTYTVNHETSHPPPSKVFVCDKPESMKEFRIjDGVSSHAIiSDS 

STEFMHQIIDQVLQGGPGICrSDISEPSPESSILSSRKEHGRSNS 

LPIKKTVHFP7U)TYKDPFCSKNI»SbCFEGSPRVAKESLRQDGHV 

IAVKVAEEKEQXQESSKIPESSSDKVAGDIFIiVEGTNNNSQSSS 

a^GALBSTARKO)EFJSHSLSPPGKNTVMADSFX}IKVNI^ 

BGDYFEAI PLKASKFNSDL IDFASTSQAFNKVPS PHETKPDEDA 

KAFBNHABKLGKRSIKSAHKKXDSPEPQVKMDKHEPHQBSGEEA 

EGCPSAPEETPVDKKPEVHEKAKRXSTRPHYEEE^EDDDLQGVG 

YPPHYEVPIAAVLEAYVEBPEDbKNBEMDIiEBPEGyMPDI^RE 
EEADGSQ3SS S SSVPGBSLPS ASDQVLYLSRGGVGTTPASB PAP 
LAPHBDHQQRB TKENDPMDSHQSOES PNIiRNI AN PLEKNVTKKS 
I SSKKXEmKHVDHVESSE»FVAPGSVQSSDDIJgEDSSDYSI PSR 
TSHSDSS IYLRRHTHRSSRSDHFSLCSVEERSRSG 


6285 


2157 


1331 


S CKTBNLEEMWWFQQGEE FLPSALVI WTSAAFI FS Y I TAVTLHJ I 
IDPALPYI SOTGTVAPBKCIiFGAMLNIAAVLCIATIYVRYKQVIl 
ALSPEENVI IKLNKAGLVLGILSCLGLS I VANFQKTTLFAAHVS 
GAVLTFGMGSLYMFVQTII^SYQMQPKIHGKQVFWlRLLLVIWCG 
VSALSMETCS SVLHSGNFGTDIiEQKLHVfNP EDKGYVLHMITT AA 
EWSMSFSFFGFFLTYIRDFQKISnRVBANI^GLTLYDTAPCPIN 
NERTRLLSRDI 


6286 


1619 


27* 


KAGASCCGSLANPYVSVGKSCVLLA^^ 

PFSIPAASEIADLSNI INKLLKDKNEFHKHVEFDFLIKGQFLRM 
PLDKHMEMEia SS BE VVB IE YVElCyTAPQPEQCMFIIDDWISS IK 
GAEBW I LTGSYDKTSR IWSLBGKS IMTI VGHTDVVKDVAW VKKD 
SLSCXLIiSASMDQTII^EVmBRNKVKALHCCRGRAGSVDS IA 
VDGSGTKFCSGSWDKMLKIWSTVPTDEEDKMEESTI'JRPRKKQKT 
EQI^LTRTPIVTI*SGHMEAVSSVI>HSDAEBICSAS^HTrRVWD 
VESGSLKSTLTGNKVFNCISYSPLCKKIASGSTDRHIRLWPPRT 
KDGSLVSLSIiTSHTGWVTSVKWS PTHEQQLI SGSLDNI VKLWDT 
RSCKAPLYDIAftHEDKVLSVDWTDTC 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
Corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=.Cysteine, D=Aspartic Acid, K= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-Histidine, t=Isoleucine, K=l>ysine, 
L=Leucine, M-Methionine, N^Asparagine, 
P=Proline, Q=Glut amine, R^Arginine, 
S^Serine, T=Threonine, V=Valine, 
H=Tryptophan r Y=Tyrosine f X ^Unknown, *sStop 
Codon, /^possible nucleotide deletion, 
\~possible nucleotide insertion) 








TTSHVGA 


6207 


* 


1482 


MQi'PFNFQIGI.RSTSGKEKYSGDAG^EGDALQLFLQCLALDBDF " 

APAIOjQVQKIl^IJ^PEm.KEGIJCBSSWSSLPCrKNRPPDP^ 

VMEESOSLNEPS PKQSEEIPE VTSBPVKGSLNRAQS AQS INSTE 

MPAREIX^KRVSSK PVLSVQEKGVIJjKRKLSIJ^QDVJ^NEDGR 

NKLKKQGBTPNEVCMFSLAYGDIPBBLIDVSDFECSLCMRLFFB 

PVTTPOGKSFCmCIiKRCIiDHAPyCPLCKESLKEYLADRRYCW 

OLl^EIilVTaijPDELSERKKIYDEBTAELSHLTKNVPIPVCTMA 

YPWPCPLHVFKPRYRI^IRRSIQTGTKQFGMCVSDTQ^SFADY 

GCMLQIRKVHPIjPIjGRSVVDT^ 

YLEDV 


6288 


1 


743 


VTLYPCRGLVGNLLIXJASGMASGCKIGPSILNSPLANU5AECLR 

MLDSGADYI»HTjDVMIX3HFVPNITFGHPVVESI^^ 

HMMVSKPEQWKPMAVAGAWYTPHLBATEHPGALI KDIRENGM 

KVGLAlKPGTSVEYIiAPWANQIDMALVM^ 

PKVHYf LRTQ PPSLDIBVIXXJVGPDTVHKCAEAGANMrVSG SAI M 

RSEDPRSVINLLRNVCSBAAQKRSIiDR 


6289 


1 


743 


^Yl>CRGLVGNIiU3ASG^ 
MLDSGADYIiHlADVMIXSHF^ 

HJ*OTV5 KPEQ WKPMAVAGANQYTFHLEATENPGALI KDI RRNGM 
KVGIAIKPGTSVF^ttJVPWAl^II>MAL^^ 

PKVHWIiRTQFPSIiDI EVDGGVGPDTVHKCIAEAGANMIVSGSAIM 
RSEDPRSVINLLJRNVCSEAAQKRSLDR 


6290 


3 

• 


1856 


TlGRWiiJt/3VYETVAPTl*AC^ 

VPQSJ^LKGITKHALNHHPPPBKLERISPTSDSHEKDTSSQSKS 
DITRJBSSFTSADTGNSI^AFPSYTOAGISTEGSSDFSW5YGEI,D 

QNATEKVQTMFTAIDEIjLYE qki»s vhtkslqe e cqqwt as f phl 
RILGRQIITPSEGYRLYPRSPSAVSASYETTLSQERDSTIPGIR 
GKKLHFSSS YAHKASS IAKSSSFCSMERDEEDS I IVSEGIIEEY 
I»AFDH I DIEEGFHGKKSBAATEKQKLGYPPIAP FYCMKEDVLAY 
VFDSVWCKWSCMEQLTRSHWEGFASDDESNVAVTRPDSESSCV 
LSBLHPLVIJ?RVPQSKVLYIT8N^ 

T^I^PRm^IiMCICLI^I^DKLl^PGSSTIIiSTRlWPKRAVEFS 
TSSLSYTVQSTRRROTPPRTLHPISTSHSCAETPRSVEEIIiRGA 
RVpVAPDSl^SPSPTPIiSRNNIiLPPIGTAEVEHVSTVGPaRQMK 
PHGDS SPJVQSAVVDEPNYQQPQKRLIjLPDFFPRPNTTQS flldt 
QYRRSCAVEYPHQARPGRGSAGPQIiHGSTKSQSGGRPVSRTRQG 
P 


6291 


1732 


602 


LVaKMASSASART PAGKRVINQEEIiRRIMKEKQRLSTSRKRIBS 
PFAKYNRLGQLS CALCNTPVKSBLLWQTHVIjGKQHRBKVAELKG 
AKEASQGSSASSAPQSVKRKAPDADDQDVKRAKATI.VPQVQPST 
SAVTrXNFDKIGKEFIRATPSKPSGI»SI»LPDYEDEEEEEEEEKGD 
GERKRCDASKPLSDAQGKEHSVSSSREVTSSVLPNDFFSTNPPK 

ft r\f -r tyii o f*» CtBiflkD "r UTMTWt JW DOI/NTA 17 AT.'D WZ WJTTmOB'Un & I? V 
Jz.tr A. X PHoLro JUS JUu£ J.JHiS CVV V ISKHISW X AtSAlif kajK r L/Utr o v iv/uc V 

RKVDAPKDQMDKEWDEFQKAMRQVNT I SEAIVAEEDEEGRLDRQ 
IGElDEQIBCYRRVEKr^NRQDEIKNKLKEILTIKELQKKEEEN 
ADSDDEGBLQDMiSQDWRVKGALL 


6292 


1835 


1142 


TCPGAMKIWAPWTRFYSNSCCI>CCHVRTGTIUXjVWYI»I inaw 

jjLILLSALADPIXJYNFSSSEIjGGDFEFMDDANMCIAIArSLI.M I 

h I CAMATYGAYKQRAA W I IPFFCYQIFDFALNMLVAITVLIYPN 

SlQiCTIRQLPP)WPYTOD\nteVNPT(^V^ 

MSCVWNCYRYINGRNSSDVLVYVTSJ^ 

AKEPPPPYVSA 


6293 


2382 


1035 


FWCrTLGfVDVHPIGWCAINSKILVPPRTIHAKFTDWKGYLMKRI> 

VGSRTLPVDFHIKMVESMKYPFRQGMRJiEVTO 

DTVIGGRLRIiLYEDGDSUDDFWCHMWSPIjIHPVGWSRRVGHGIK 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

ami r\r~i :*/-■"■» r-i 
CUUXilsJ ctCJLCL 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C-Cysteine, p=Aspartic Acid, E= 
Glutamic Acid, F=»Phenylalanine, G=Glycine, 
H=Histidine, I-Isoleucine, K=t.ysine, 
L-Leucine, M=Methionine, N^Asparagine, 
P-Proline, Q^Glutamine, R-Arginine, 
S^Serine, T=Thr eon ine , V=Valine, 
W=Tryptophan, Y«Tyrosine, X-Dnknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=poaoible nucleotide insertion) 








MSERRSDMAHHPTFRKIYCDAVPYLFKKVRAVYTBGGWFEEGMK 
LEAIDPLNLGNICVATVCKVLJ^^ 

HASSHAI FPATFCX5KNDIBLTPPKGYEAQTP1WEHYLBBCTKSKA 

APSRLFNMIX!PJ^GFKVGMKLEAVDrj^BPIU J ICVATVKRVVHRL 

LSIHFIXJWDSBYIXiWVDCBSPDiypVGWraLTGYQLQPPVAAEP 

ATPLKAKEATKXKJOCQPGKiauattPPTK^ 

PQGARKISSEPVPGEI I AVRVKEEHLDVAS PDKASSPELPVSVE 

NIKQETDD 


6294 


354 


1814 


AQLTTRGRTVAGGVRVf I PSPFPDLPJjYS CCLGTDRGFPELSHHC 

KNVIATASDYD{^ITNIRPSFDVSPVVAGLIGASVLVVCV9VT 

VFVWSCCHO^ABKKHKNPPYKFIHMI^ISIYPETLSNKlffi 

VRRDF0X3 PGREGGRRNLLVDAAEAGLLSRDKD PROPS SGSCI DQ 

LPIKMDYGBBliRSPITSLTPGBSKTTSPSS PEEDVMLGSLTFSV 

DYNFPKKALVVTIQEAHGI^VMDDQTQGSDPYXKMTILPDKRHR 

VKTRVLRKTIJ)PVFBETPTFYGIPYSQLQDLVLHPLVI^ 

RDIJVXGEVMVPLRGVDPSlGKVQLTRDI I KRNIQKCISRGELQV 

SLSYQFVAQRMTVWLKARIu^QKMDIAGLSGNPY^ 

R I AKKKTHVKKCTLNP I FNES F I YDI PTDLLPDI S I BFLVI DFD 

KTTKNEWGRLI IX1AHS VTAS OAEHWREVCRS PRKPVAKWILSLS 

BY 


6295 


279S 


617 


VSSALLTGATSGSnAAKSEGASASPI^ClMAVAMDRPDEGPPAK 
IWLSSSESPQRDPPPPPPPPPLLRLPLPPPQQRPRLQEETEAA 
QVIJu^MRGVGl^PAIiPPPPPYVILEBGGIRAYFTLGAECPGWDS 
TIESGYGEAPPPTESJbEALPTPEASGGSIiBIDFQWQSSS FGGE 
GAJjETCSAVGWAPQRLVDPKSKEEAI I IVBDEDEDSRESMR83R 
RRRRRRRRKQR RVKRBSRERNAERMES 3XQALED I QLDLEAVNT 

kagkaflrlkrkfiqmrrpflerrdli IQHIPGFWVKAFLNHPR 

ISILINRRDKDIFRYLTNLQVQDrj^ISNKJYKMKLYFOTNPYP-r 
NMVIVKBFQRNRSGRLVSHSTPIRWHRGQEPQARRHGNQDASHS 
FESHFSNHSI^EADRIAEIIKNDLWVNPLRYYIiRERGSRIKRKK 
QEMKKRKTRGRCEWIMEnAPDYYAVEDIFSEISDIDETIHDIK 
ISDFMBTTI^FKTTDNEITDINENICDSENPDHNEVKHNETTDN 
NESADD}IETTDWJESADDNNENPEDN>fKNTDDNKENPNNNENTY 
GNNFFKGG FWGSHGNNQDS3DSDNBADEASDDEDNDGNEGDNBG 
SDDDGNBGDNEGSDDDDRD I BYYEKVI BDFDKDQADYEDV1 E 1 1 
SDBSVEEEGIEBGICXJDEDIYEBGNYEKEGSEDVWEEGEDSDDS 
DLBDVLQVPNGWANPGKRGKTG 








6296 


727 


1199 


RHCGCDAQGACBSI^PTGTSSPVTARNAIPEARCCV^LJDGTTV 
EA\mPARERIiARKEIjRQKP^O^FSRDSAYSSNKDSTCLLTERI>T 
IX5TSLQFPSPFSGTISFGSFSDSGIFPLGSOCCLGFQQFSISGK 
KWAL IHKRVRLSVFGARWGR I Y FGK 


6297 


1 


922 


QRAAAAS PSSCGPRGABYGALMAMEGYWRFLAI^SAIiLiVGFLS 

VIPALVWVLHYRBGIX^^SAIiEFNWHPVI^ 

VYRLPWTWKCS KLLMKS IHAGLNAVAAI LAI ISWAVFENHNVN 

NI ANMYSLHSWVGLIAVI CYhhQhLSGPS VFLL PWAPLSLRAFI, 

MPIH\nrSGIVIFGTVIATAIJ^LTBiCLIPSLRDPAYSTFPPEGV 

FVWn^LLILVFGALIFWXVTRPQWKR^ 

GARGSMPAYSGNNMDKSDSELNNBVT^ 


6298 


3 


985 


SVPLRRIiSLrSGTLC^lAGTTrKMAVARL^ 
PFGPHRGLSVLLARIPQRAPRWLPACRQKTSLSFLNRPDLPNLA 
YKKT.KGKSPGIIFIPGYLSYMNGTKAT.AIEEFCKSLGHACIRFD 
YSGVGSSDGNSEESTLGKWRKDVLSI IDDLADGPQILVGSSLGG 
WLMLHAAIARPEKWALIGVATAAI*raVT 
KGVWSMPSKYSEEGVYNVQYSFIKEAEHHCLLHSPIPVNCPIRL 
IiHGMKDDI VPWHTSMQ VADRVLSTDVDVI LRKHSDHRMREKADI 
QLLVYTIDDLIDKLSTIVN 
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Anino acid segment containing signal peptide 
{A=Alanine, C^Cyateinc, D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine, K^Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P= Proline, Q=Glutamine, R=Arginine, 
S=5erine, T-Threonine, V^Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *»Stop 
Codon, /-possible nucleotide deletion, 
\opossible nucleotide insertion) 


6299 


512 


B14 


ECDLEGIMPNVTISI^LPTNGSPLQDILWPCVTSIiDSAILTSS 
S I nAMDDS AFS GP YKFP FT PPLESPNIiCFYTSQ VP VP PILG FYQ 
MKEEKVQLRNNH 


6300 


121 


| 692 


AAPS CWSQRGVPAAGTPSS PRLLVSPJ\AAPSAG PWGAWRQGARA 
AQSPFSIPNSSSVPYGSQDSVHSSPBDGGGORDRPVGGSPGGPR 

ITYNEDVIiS KDAfiKCAICLEELQQGDTIARIiPCI»CIYHKGClC>B 
WFKVNRSCPEHPSD 


6301 


616 


284 


GKFVPVNWEPPQPLPFPKYLRCYRClJ^rKKi^CLLGiDICLTP 
AGSSCITl^KKNSSGSDVMVSIX^KEQI^IX^NTRTSPVSGFW 
IFSQYCFLDFCNDPQNRGLYTP 


6302 


490 


745 


I FGFLHLFHNEHS PLLVCALFAHVFFSSSCGSS VAIiHSDPCliIiS ' 
PVLI^CIjPGDIiRPLiDEIiYAQKLKYKAI SBBLDHALNDMTS L 


6303 


2 


1951 


yv^bygggllwqswqekhpgqalssepwnfpdtkeeweqhysql 

ywyyleqfqyweaqgwtfdasqscdtt^ 

kvdlvs plsspimgdndssgtsdkdhseildgisniklnseevt 

qsqldsctshdghqqlsevsskrecpasgqseprkggtnbb3ns 

sgntmtdppaedsqkssgantskdrphasgtdgdeseedppehk 

psklkrsheldidenpasdfddsgsllofkygsgqkyqgi pnps 

imqvryiiek^klkskyldmrrqikmknkhifftkksekpffkk 

skilskvekflthvnkphmeeasqhssshdnohdasrscdsbeq 

dwsvkxgddixktnnpepektf^ssagelete^ 

pdeork^m3e\ap0srqaeteaevkkxknkkknkkvngi^peiaa 

vpelaxywaqryrlf^fddgikijiregwfsvtpekiaehiagr 

vsqsfkc1>vvvdafcgvggntiqfaltg^viaididpvkiala 

rnnaevygiadkjeficgdfij^lasflkadwfi^ppwgapdya 

taetfdirtmmspixsfeifpjbskkitnnivyflprnadidqvas 

LAGPGGQVE IEQNFLNNKLKTITAYFGDLIRRPASET 


6304 


1 


143 B 


HRARVDRSRESPGGDI^HPGRVRRDITI>SGHPRLST , UHVVIJjRE 

DEVGDPCrnci^HPQHGSPIQETQSEVVTLVSPLPGSDMAA^ 

NRATS GLTLWPHTAEGRDLt^GAENRALTGGQQAEDPTIASGAYQ 

WPGSVEKLQGSWCDAETLLSSSRTX^APPWLTDBDVQMIaRl.L 

AQSEVVDKARVPAHGQVLQVGFSTEAALQDIjSSPRIiSQLCSOGIi 

CXSLIKRPGDIJBVI^FHVDRVWJLRRS^ 

TDGGARPVXWV^PDVQHLSDPDm^SIJVI^I^YQAbLAIISCN 
HPGQAP CPGIHHTOWARIiALFDFIJXJVHDRIiDR D 
PCVEE RLREKCRNPABt jRLVH ILVRS SDPSHIiVYIDNAGNLQH p 
EDKLNFRlJ^EGIIX5F?BSAVK\rLASGC 
SQGGAQGLKQVLQTLEQRGQVliliGHlQKH^TLFRDEDP 


6305 


39 


420 


nmiwrgrstyrprprrsvpppki^igpmlepgdeepqqeepptes 
rdpapgqerebdqgaaetqvpdiieadiiqei>sosktgdbcgdgpr) 

VQGKIIiTKSBQFKM PEGR 


6306 


1 


1874 


PTRPSKVKVPHTFLIHSYTRPTVCQACKKLLKGLFRQGIiQCMC 
PkCa uliUtUi LKVrCliArXiurnAJb JiNbUVPHKKAi Ure>aADKSfiJMu 
BS E0SGVI PGSHSEN ALHASBEE EGEGGKAQSSLGYI PLWRWQ 
SVRRTTRKSSTTLREGWVVHYSNKDT^ 

NNTTNR Y YKE I PLS E 1 LTVES AQNFSLVPPGTNPHCFE I VTANA 

TYPVGEMPGGTPGGPSGQGAEAARGWETAIRQALMPVILQDAPS 

APGHAPHRQASLSISVSMSQIQEHVDIATVYQ1PPDEVLGSGQF 

GVVYGGKHR KTGRDVAVKVTDIOjRF PTKQESQLRNE VAI LQSLR 

HI»GIVI^E(^FBTPEKVFVVMEmiGDMLE^ 

TKFLITQ I LVALRHLHFKNI VHCDLKPENVLLAS ADP FPQVKLC 

DFGFAR1 IGEKSFRRSWGTPAYIiAPEVIJJ30GYNRSIJ»W 

V I MYVSLSGTFPFNEDED INDQ IQNAAFMYPASPWSHI 8 AGAID 

L INNLLQ VKIWKRYSVDKSI^HPWI^EYOTWT^ 

YITHESDI>ARWEQFAAEHPI^SGLPTDRDLGGAC^^ 
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Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F*=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L^Leucine, M=Methionine, N^Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S^Serine, T=Threonine. V= Valine, 
W=Tryptophan, Y=Tyrosine, X- Unknown, *»stop 
Codon, /^possible nucleotide deletion, 
\=possib!e nucleotide insertion) 








LAKRXSVI, 


6307 


2136 


589 


CFLLPRGRDPEPPEAGAAAPOu^GAPDMSFRKVVRQSKPHHVFG 
QPVKND0 CYED I RVSRVTWDSTFCAVNPKFLAV1 VEASGGGAFL 
VI*PIiSKTGRIDKAYPTVC»fn , GPVLDIDWCPHNDEVIASGSEDC 
TVMVWQIPEWGLTSPLTKPVVVLEGr-maiVGI IAWHPTARNVLL 
SAGCDNVVL I WNVGTAKBLYRLDSL.HPDL I YNVS WNHNGSLFCS 
ACKDKS VR I IDPRRGTLVAEREKAHEGARPMRAI FLADGKVFTT 
GFSRMSBRQIALWDPENI^BPMATiQBLDSSNGAriLPFYDPDTSV 
VYVC0KGDSS IRYFEITEEPPYIKFLNTFTSKEPQRGMGSMPKR 
GLEVS KCE IARFYKLHERKCEP I VMTVPRKSDL FQDDLYPDTAG 
PEAALEAEEWSGRDADPILISIiRBAYVPSKQRDLKISRRNVLS 
DSRPAMAPGSSHtt^ASrTTAAOATPSGSIJ^^ 
QELRALRALVKECX3DRICRIJSEQI/5RMENGJDA 


6308 


2 


111B 


GRPTRPEKMIiSLVLHTYSMRyiinPSVVlIfiT 
I^SAFT^PARFYQALDDRLYC^QSMVLFEFENYTGVQILIjYGDL 
PKNKENI I YLANHQSTVDW IVADILAXRQNALGHVRYVLKEGLK 
WLPIjYGWYFAQHGGIYVKRSAKFNEKEMRNKLQSYVDAGTPMYL 
VIFPEGTR YNPEQTKVLS AS QAPAAQRGLAVIiKHVLrPR I KATH 
VAFIKMKNYLDAIYDVTVVYEGKDDGGQRRBSPTMTBP^ 
KIHIHIPRIDKKDVPEEQEHMRRWIiHERFEIKDKMIjIEFYESPD 
PERRKRFPGKSVNSKI^S I KKTLPSML I LSGLTAGNDMTDAGRKL 
YVNTWIYGTLLGCLWVTIKA 


6309 


220 


563 


LVAEVKB PCSI*PMLS VDMENKENGS VGVKHSMEWGRPPDPADWA 

VMDVVNYFRTVGFKEQASAFQEQEinGKSI^ 

lO^PALKIYEYHVKPLQTKHLKNNSS 


6310 


36 


979 


GPRCMKFLILSSVNCETIiRIGKAWPQSSGQERYWTPRTHSSASE 
AQRGSIJVEIJvIVAAAGLWADCDQPLYDCPMCGIjI CTNYHILQEHV 

dlhleens fqqgmdrvqcsgdlqlahqlqqeedrkrrs besrqe 
iebpqklqrqygijwsggykqqqlrn^ievnrgrmppsefhrr 

KADMMESLALGFDDGKTKTSGI IBALHRYYQNAATDVRRVWIiSS 
VVDHFHSSLGDKGWGCXySfRHF<HffiI*SSI»I^ 
CIPKIQSMIFJWfKEGFDPQGASQLIIRiC^TKAWIGACEVYIL 
LTSIiRV 


6311 


1 


675 


PVWWNSCBGPRIJLAAARTGHGWRRAPJLACI^EPRVKAAWXTL 

AS KLKRDDGT-iKGS R TAAXAS DS TRRVSVRDKLIiVKBVARLEANIi 

PCTTCKVHFPDPNIG^ICFOLTVTPDEGYYQ^ 

MVPPK^CIVITUWHPNITETGBICLSLI^^ 

KDVVWGHgSLFTDLLNFDDPmiEAAlfflHLRDKEDFRNXVDDYI 

KRYAR 


6312 


213 


1400 


GDEi VKRKAGMKMLPG VGVFGTGSSAR VLVPLLRAKG FTVEALW 
GKTEEEAKQIJu^EMinAFYTSRTODIIXHQDVDLVCISI PPPLT 
RQISVKAIX3IGKNWCEKAATSVDAFRMVTASRYYPQLMSLVGN 
VLRFT.PAFVRM KQLI S EHYVGAVM I CDARI YS GSLLS P S YGW I C 
UaljPgAfJi3lia±T%JX I IVDIjI*THLTBRKAEKvHGIiL AT J?" VKUWAA 
I RG I RHVTS DDFC FFQMIjMGGGVCSTVTIiNFNMPG^ 
GSAGRLVARGADLYGQKNSATQEELLLRDSLAVGAGLPEQGPQD 
VPLLYLKGMVYMVQALRQSPQG<X?DRRTWDRTPVSMAASFEDGL 
YMQSVVimiKRSSRSGEWEAVEVLTEEPDTNONIX^ALQRNNL 


6313 


2 


2071 


QRSGAARLAFl.PSPFSPACVHRJSPIjSFHGCWYFVWFMPLGVL - ' 

FIIRRRAHGCITi3(^SFVEQPTAMFAEETMBCLQEFPEHHKMILD 

RLNEQREQDRFTDITLIVDGHIIFKAHKAVLAACSKFFYKFF^ 

TQEPLVEIEGVSKmFRHLIBFTYTZu^IQGEEBAlTOWKTW^ 

FLQMLEAIKAIxEVR^KENSAPLEENTTGKNEAKKRK^ 

TRSLPS AESEPVEIEVE I AEGT IEVEDEG IETLE EVAS AKQS VK 

YIQSTGSSDDSALAIJJU)ITSKYRQGDPJCGQIKEIXX^PSDPTSK 

QVEGIEIVKI^I>SmraLFHCEKCNRSFKT,PYHFKEHMKSHSTE 
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Amino acid segment containing signal peptide 
(A=» Alanine, C= Cysteine, D-Aspartic Acid, E= 
Glutamic Acid, F- Phenylalanine, G=*31ycine, 
H=Hietidine „ I=Isoleucine, K-Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P= Proline, Q=Glutamine, R=Arginine , 
S=xSerine, T=1*hreonine , V^Valine, 
W=-Tryptophan , Y°Tyxosine , X=Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








SFKCEICNKRYIiRBSAWKOHI^CYHLEEXSGVSKKQRTGKKIHVC 
QYCEKQFDHFGHFKEHLRKKTGEKPFECPNaiKRFARNSTLKCH 
LTACQTGVGAKKGPJCKLYECQVraSVFNSWDQPKDHLVIHTGDK 
PMKTTLCDLWFMQGNELRllHLSUA^ 

TEPVTSWT1 IEQVGKNTEP7LPIXQVQVDSAQVTVEQVHPDI*LQDS 
QVHDSHMSBLPEQVQVSYT»EVGRIQTEEGTEVHVEELHVERVNQ 
MPVEVQTEUl^EADLDHVTPEIMNQEERESSQADAAEAAREDHED 
AEDLETKPTVDSBAEKAENEDRTALPVLK 


6314 


2 


2071 


QRSGAARLAFliPSPFSPACVHRSPLSFHGCWFYPVVVFMPI.GVL 
FHPJUiAHGCn'LSCSSFVEQPTAMBAEETMCLQEPPEHHKMrLD 
RLNEQREQDRFTDITLIVDGHHFKAHKAVIiAACS KFFYXFPQEF 
TQBPLVEIEGVSKMAF1UILIEFTYTAKLMIQGEEEANDVWKARE 
PLQ^!LEAIKALEVPJ^KENSAPLEEmnreKNBAKKRKlAP^SNVI 
TESLPSABSEPVEIEVBIAF^TIEVEDEGIETLEEVASAKOSVK 
YIQSTGSSDDSALALIiADITSKyRQGDRKGQIKEDGCPSDPTSK 
QVEGIEIVEIiQItSHVKDLFHCEKCNRSFKIiFYHFKEHMKSBBTE 
SFKCE I CN KRYI.RES AW KQHLNCYHLEEGGVS KKQRTGKKIHVC 
QYC^KQFDHFGHFKEHLPJOm5EKPFECPNCHERFARNSTI*KCH 
LXACQTGVGAKKGRiaa,YECQVCNSVFNSWDOFKDHLVIHTGDK 
PNHCriX^)LWFMQGNELRRHLSDAIINIGKlil>VTEEVLSVETRVQ 
TBP VTSMTI I EQVGKVK^PLIX2VQVDSAQ VTVEQVHPDLIjQDS 
QVHDSHMSELPEQVQV9 YLEVGR I QTEEGTEVHVESLHVERVNQ 
MPVEVQTELLEADIiDHVTPE IMNQEERESSQADAAEAAREDHED 
AKDIiBTKPTVDSEAEKAENEDRTALPVLE 


6315 


1 


1015 


I^TAVNVVTTLVLISYCPTATEEAPYVmrLI^AJ^LFIYQSLDA 
IIXJKQARRTNS GSPLGBEFDHG CDSLS 7VFMAVGAS I AARLGTY 
PDWPPSCSPIGMWFYCAHWQTYVSGMLRFGKVDVTEIQIALVI 
VPVLSAFGGATMVfDYTIPILEIKLKILPVL3FI*GGVIFSCSNYF 
HVILHGGVGKlJGSTIAGTSVIiSPGIillGLIIIIiAIMIYKKSATD 
VPEKHPCLYIIMFGCVFAKVSQKIiWAHm'KSELYliQDTVFl^P 
GLLFIJ^YPNNF IDE YWLWMAMVIS SFDMVI YFS ALCLQ IS RH 
IjHLN I FKTAaiO^PEQVQVLSSKSHQNNHD 


6316 


1503 


792 


VSAGJldl^3IMGGTTSTRRVTFFJU3ENEN I TWKGI RI>S ENV I DR 

MKESSPSGSK5 QR YSGAYGASVS DEELKRRVAEBLALBQAKKE S 

EDQKRLKO^KEI^RBRAAAKEQLTRAIIjRERICSEEERAKAKHIi 

ARQLBEKDRVLIO£QDAFYKEQIARLEE^ 

EEVEAKFKRYESHPVCADLQAlCCLCXrYRENTHO/r^ 

MHCVNHAKQSMLBKGG 


6317 


102 


839 


PEAQTSAVLAPJBKXJHLPTMPJHE^ 

FDYMPKI^tlGNSSVGKTSFLFRYADDSFTSAFVSTVGIDFKVK 
TVFKNEKRIKXQIWETAGQERYRTITXAYYIIGAMGP tLMYDITN 
BESPl^VQDWSTQI KTYS1TONAQVILVGNKCDMEDERVISTBRG 
QIILGEQLGFEFFETSAKI^INVKQTFBRLVDIICDKMSESbETD 
PAITAAKQNTRIiKBTPPPPQPNCAC 


6318 


1765 


733 


PWHPLRTI»PI^HPHPKPPRAEGKEGAD&MSHIj 
IX^PIiSPFPI^AGSWHRQ^^>RSSLRFPIT^^SAGAPCKAAGRWNI 
LAP VRRDRVIjAELP QCIiRKBAAUIGHKDFHPRVTCACQEHRTGT 
VGFKJISKVIVVGDI^VGKTCLrrmFCKDTFDKNYKATIGVDFEM 
ERFEVLG I PFSLQLWDTAGQERFKCIASTY YRGAQAI IIVFNLN 
DVASLBHTKQWLADALKENDPSSVLLFLVGSKKDLSTPAQYALM 
KKDAIX)VAQF>MKAEYWAVSSLTGENVRBFFFRVAAI»TFEANVLA 
ELBKSGARRIGDVVRnJSDDSNLYLTASKKKPTCCP 


6319 


88 


717 


AATMRLNC^TLIJjGKKVVLVP YTS BHVPS R YHEWM KS EELQRLT 
ASEPLTLEQEYAhKK^^EDADKCTFrVLDAEKWQAQPGATEES 
CMVGDVNLFLTDU5PLTLGEIEVM lAEPSCRGKGliGTEAVLAML 
SYGVrrLGI.TKFEAKIGQGNBPSIRMFQKLHPEQVATSSVFQRV 
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Amino acid segment containing signal peptide 
(A^Alanine, OCysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H^Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine, 
P=Proline, Q^Glutaraine, R«Arginine, 
S=Serine, T»Threonine, V= Valine, 
W=Tryptophan, Y^Tyrosine, X=0nknown, +=>Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








TIiRLTVSE S KHQWLLEQTSHVE EKPYRDGS AE P C 


6320 


90 


1111 


RPRTGREKVAMAAVDSFYU/YRE IARSCWCYMEALALVGAWYTA 
RXSI TVI CDFYSLIRLHFIPRIX?SRADLIKQYGRWAWSGATDG 
IGKAYAKELASRGLNI ILISRNEEKLQVVAKDIADTYKVETDI I 
VADFSSGRE I YLP IREALKDKDVG ILVNNVGVFYPYPQ YFTQI>S 

PTPQLAAFSASKAYIiDHFSRALQYEYASKGI FVQSIilPFYVATS 
WAPSNFXHRCSWLVPSPICVYAHHAVSTLGISKRTTGYWSHSIQ 
FBFAQYMPEWLWVWGANILNRSLRXEALSCTA 


6321 


| 141B 


341 


HRKAAijGAIiMAGRIJ^KAIJmvSI^IJU^VTIRSSRCRGrO^ 
RNS FSSS W PHLNTNVMSGSNGSKENSHNKARTS PYPGS KVERS Q 
VPNEKVGWLVEWQDYKPVEYTAVSVIiAGPRWADPQISBSNPSPK 
FNE KDGHVERKSKNGL YE IENGR PRNPAGRTGLVGRGLLGRWGP 

rtHAAJJfXJ. I KWKi<D^50«KXpUiPVSGKH X JX2fv AIxuaUAAiEWA 
I PGGMVDPGE KI S ATLKRBFGEBALNSLOKTSAEICRR I KK KJUHK 
LKS QDHLV I YKGYVDDPRNTI3N AVWETEAVNYHDETGE IMDNLM 
rj^DDAGKVKWVDINDKLKLYASH^QFIIO^VAEKRDAHWSEDS 
BADCHAL 






JlVoj 


wnnTT irtn7T7C'(!D , iiunDiivr cot x at-matnirvitr , *T»tJrtr*i*in** m *'tt\Tama 
HUol JbJ\JNVr.£»£>Kl VyFtlr l^rljl»aiJLrWbVDVGRHP t3WTGHV3X3 

WS INCCI^EGSCX3BEVISSE1)IGASXF1^KKVLYYADA1,TEI 

AFWPS PVES LTDSLESNI SDQDSDSNMDLMPG ILKDPSLTLEL 

FPNHTDNLNSSQRLSPSSRMRKLPQGRPVPPLGPKTRVSWWVE 

RYDDIENFPLSELmSISTGVETTANSSTSIJISTTLEKEVpVIP 

IROPLNTGLPRIICTOGATGKFNMVIPLVD^ 

VINI C^KRLKSIKYSPPHVRRKQKITDIVNKYRNKQLEPEFYT 

SliFQEVGLKNCSS 


6323 


1 


656 


PASTTDGAQEARVPLDGAFWIPRPPAGSPKGCFACVSKPPALQA 
PAAPAPEPSASPPMAPTLFPMESKSSKTDSVRAAGAPPACKHLA 
EKKTMTNPTTVTEVrTOTTEVNDYYLWS IFNFVYLNFCCW5FIA 
IjAYS LKVRDKKIiLNDLNGAVEDAKTDR^ GLAAS C I MLW 
MALS VIATHRGU?SSAS ILVARPHDWSrrERPQVTFRERCPAL 


6324 


1 


2061 


EGAGMRRCPCRGSLNEARAGALPAAARMGLEAPRGGRRRQPGQQ 
RPGPGAGAPAGRPEGGGPWARTEGSSIiHSEPERAGUJPAPGTES 
PQABFWTDGQTEPAAAGLGVETERPKQKT^ 

rn^LETHGSG/rQPERVKSWADNLWTHQNSSS LQTHPEGAC PS KE 
PSAWSSWK^YTIX^RTG^DIEGPWTKPYTDGSQiatQDTEAARK 
QPGTGGF^IQQDTIXJSWTQPSTDGSG^I7^TDCXI^EPEIX3Pr^ 
EPEPGELLTHLYSHLKCSPLCPVPRLI ITPETPEPEAQPVGPPS 
RVEGGSGGPSSASSFDESEDDWAGGGG^DPEDRSGSKPWKKL 
KTVLKYS PFWS PRKHYPHVQLSGHAGNFQAGEDGRI IiKRFCQC 
EQP^LEQI^KDPLRPPVPAYYGMVU)IX3QTFNQMBT)LL^FEGP 
3 IMDCKMGSRTYLERELVKARB31PRPRKDM 
EFJIAQGAVTKPRYMQWRETMSSTSTLGFRIEGI KKADGTCNTNF 
KKTQAiEQVTKYLEDFVDGDHVIT^KYVACI^ELREALEISPFF 
KTHEVVGSSLLFVHDHTCIJUCV^ 
WAEGNREDGYLWGLDNMICLLQGLAQS 


6325 


165 


944 


GLRDP FRRiOiRLKPQVKMSNYVNrWWPGSPQBKDSPSTSRSGGS 

SRLSSRSRSRSFSRBSRSHSRVSSRPSSR5RRSKSRSRSRRRHQ 

RKYRRYSRSYSRSRSRSRSRRYRERRYGFTRRYYRS PSRYRSRS 

RSRSRSRGRSYCGRAYAIARGQRYYGFGRTVYPEEHSRWRDRSR 

TP^RSRTPFRLSEXDPMEUiEIAKlNAAEALGTrNI 

VPSAKBTSRGIGVSSNGAKPBVSIIX3LSEQNFQKANCQI 


6326 


23B 


680 


GEPS PATO^KPS ATGAGVLHOHFSSGH I YVLMGI*LP P PWTIS FT 
VQTTLQPPGGLPAAPVSGRMAFBPVGRDLARRMVPRAGKRTQTI* 
GARRVAAG<^PLPKDRRPKSGERIJnrrVAPCWEFVLPSVSLTA 
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Amino acid segment containing signal peptide 
(A= Alanine, ^Cysteine, D=Aspartic Acid, B» 
Glutamic Acid, F«Phenylalanine, G=Glycine, 
H-Histidine, I-Isoleucine, K=I»ysine, 
L=Leucine, M= Methionine, N=Asparagine, 
P^Proline, Q=Glutamine, RaArginine, 
S^Serine, T-Threonine, V=Valine, 
W^Tryptophan, Y-TyroGine, X^Unknovra, *=Stop 
Codon, /^possible nucleotide deletion, 
\-possible nucleotide insertion) 








QAWGGVGQEASSGVP 


6327 


1 


1337 


SI^I^AGGSWMPTG^PAAPSTRAPKPSRSLSGSLCALFSDA 
DSGSGMKAKLPPGPGAVGREMTKRKKLQI>RKEKKQQKKKRKEEK 
GABPETGSAVSAAQCQGPTRELPBSG IQLGTPREKVPAGRSKAE 
LRAERRAKQEAERALKQARKGEQGGPPPKASPSTAGETPSGVKR 
LPEYPQVDDLLLRRLVKKPBRQQVPTRKDYGSKVSIiFSHLPQYS 
RQNSLTQFMSIPSSVIHPAMVRICLQYSQGLVRGSKARCIALliR 
AIXX^IQD xTTPPKEEIjSRDLVNKLKPYMS fltqcrplsasmhn 
AIKFTJ«KKTTSVGSSKREEEAKSHXRAAIDRYVQEKIVLAAQAI 
SRFAYQKISl^VILVYGC^SLVSRILQEAWTEGRREllVVVVDS 
R P WLEGRHTLRSLVHAGVPAS YLIilPAAS YVLPEVS TBBKDS KV 
GGEKV 


6328 


1030 


276 


HAHAK V-ITAAARGLGAMBBEMHTnAKlRAERGTGSS PRGPGCSL 
RHFACEQNI^RPDGSASFIWDTSVIiAGVYGPABVKVBKEI FN 
KATLBVTLRPKIGLPGVABKSRKRLIRNTCEAVVLGTLHPRTS I 
TVVUXtnteDAGSIJiACCIJJAACMAL^^ 

SDGTLVLDPTSKQEKKARAVLTPAI^SVKRiCLLMSSTKGLYSDT 
ELQQ^LAAAQAASQHVFRFYRBSLQRRYSKS 


6329 


3 


2016 


S S EVAAGGGTRS AMAEG3 GE WTVSATGAANGIJINGAGGTSA'IT 

SNPLSRKLHKIIjBTRIiDNDKBMIjEAL^^ 

RGDI ERKS LAINEE FVS I FKBVKEEIJ5S I S EDVQAMSNCOQDMT 

SRLQAAKEQTQDLIVKTTKI^SBSQKLBIRAQVAXIAFLS KFQLT 

SDEMS LLRGTRBGP I TEDFFKAW^VKQIHNDVKVLLRTOIQCyrA 

GLEIMEQMALLQETAYERLYRWAQSECRTLTQRSCDVS PVLTQA 

MKAI^DRPVLYKYTIjDEFGTARRS'I'VVRGFIDALTRGGPGGTPR 

PIEMHSHDPIJiYVGDMIJVHIJIQATASEKEHI^EALLKHVTTQGVE 

EfaQBWGHITEGVCRPLKVRIEQVTVAEPGAVLLYKrSNLLKP 

YBMTISGIVGNSATALl»TTIEEMHIiEjSKKI FFNSLSIjHASKLMD 

KVELPP PDLGPSSAMJQTIWLIfRBVLASHDSS VVPIjDARQADFV 

Q\^C^/IJJPIJjQMCn^SASNIX3TAI)MATFM 

EFTDRRLEMLQFQIEAHLDTLINEQASYVLTRVGLSYIYNTVQQ 

HKPEQGSLANMPNLDS VTLKAAMVQFDRYLS APDNIi I PQLNFL 

LSATVKEQlVKG^TBiVCRAYGBVYAAVMNPINBYKDPBNILHR 

SPQQVQTLLS 


6330 


11S1 


333 


PFYYTFYBl«CrFSRKMVAEKETI^UJKCPDKMPKR^ 
PVHQPHSLVSBGFTVKAMMKNSVVRGPPAAGAF1CERPTKPTAFR 
KFYERGDFP I AIJ8HDS KX5WKIAVfKVEIBKLDYHHYIjPIi FFDGIiC 
EMTFPYBFFARQGIHDMLEHGGNKII^VLPQLIIPIKNAI*^ 

RQvramjcvMm.wsAEMVGKAi»vpy^ 

NSGDGIDYSQQKRBNIGDLIQETLBAFBRYGGBNAFINI KYWP 
TYBSCTJiW 


6331 


3 


495 


QQGQRVRTRGRRACAS AT PliEGCVDLiS YPRTKAALLJCVAQMVTIj 
LIAFI CVRSSLWTNY SAYS YFEWTICDLIM I LAFYLVHLFRFY 
RVIiTClSWPLSEI^YLIGTLLLLIASIVAASKSYNQSGLVAGA 
IFGFMATFLCMASIWLSYKISCVTOSTDAAV 


6332 


1 


878 


.VTESNKFDLVSFIPI^RERIYSWNQYARQFIISWILVLESVPDI 
NLLDYTiPE I LDGLFQIIiGDNGKE IRKMCEWLGEFLKE IKKNPS 
SVKFAEMAN ILVIHCQTTDDIilQLTAMCWMREF I QLAGR VML PY 
SSGILTAVLPCLAYDDRKKS IKEVANVCNQSLMKIiVTPEDDELD 
BIiRPGQRQABPTPDnAXPKQEGTASGEWTPSLHIiTSCRGPREPD 
VI GVALGPHLSNQDYFMYVTHT I VAATQRSGSSGS PP FCRQDTG 
KI^TMATHSQLVKTGTGLEPRQAVSSSH 


6333 


3 


1467 


TRTPSEAEAGGESPO^CVSAAHSDttTAGKPVSLLAPLiPPRSAG 
QPLTFS PSGRQ PLRSLLVGMCSGSGRRRSSLSPTMRPGTGAERG 
GLWMGHPGMHYAPMGMHPMGQRAlMPPVPHGffllPQM^ 
MGQMPGMMSSVMPGWWSH^O^MQFAI^PGVNSMDVAAGTAS 
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Amxno acid segment containing signal peptide 
(A=Alanine , C^Cyeteine, D=*Aspartic Acid,' S= 
Glutamic Acid, F= Phenyl alanine , G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
L=Leucine. M=Methionine, N=Asparagine , 
P=Proline, Q=»Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon r /=possible nucleotide deletion, 
Vpossible nucleotide insertion) 








GAKSMVTliSlti^PDGRTyYYNTETKOSTWEKPDDLKTPAEQLIiSK 

CPW K3YKSD3GKP YYYNS QTKBSRWAKPKELEDtiEG YQNTIVAG 

SL1TKSNLHAMIKAEBSSKQKECTTTSTAPVPTTBI PTTMSTMA 

MEAAAAVVAAAAAAAAAAAAANAHASTSASN^ 

EVTS I VATWDNENTVT I STBE QAQLTSTPAIQDQS VEVS SNTG 

BE^SK^KTVADPTPKKEEEESQPAKKTYTHNTKBEAKQAFKE^ 

KEKRVPSNAS WEQAMKMI IHDPRySALftKLSBKKQAPHAYKVQT 

BKK 


6334 


- 17 


644 


GGNPSGRAAGFAAAftWPSSPbRVAWCSSNQr^MBAHNlI,SKR 

GFSVRSF^GTHViapGPAPDKPNVYDFKTrYDQMY^ 

ELYTQNGII.HMLDRNKRIKPRPERFQNCKDLFDLILTCKK^VYD 

C^n^LNSREQETCQPVHVVm^IQM^EATLGAFLICELCX?C 

IQHTEDMENBIDKLtiQEFEEKSGRTFLHTVCPY 


633S 


82 


S29 


AARARPGVLCC^IjLGAAIiGDQSRVEMSYIPGQPVTAVVQRVBIH 
KIJIQGENLIIiGPSIGCGIDQDPSQNPFSBDKTDKGIYVTRVSEG 
GPAR IAGXjQ IGDKI WWGWDWTMVTHIXJARKRLTKRS E EWRL 
LVTRQSLQKAVQQSMLS 


6336 


10D3 


438 


HEPASKGRAWGWMRI^VAAAISHGRVPRRMGI^PESRIHLLRN 

I^TGLVRHEl^IEAPWARVDEMRGYAEKLIDYGKI^DTNERAMR^ 

ADFWI»TEKDLIPKLF<IVIJVPRYKDQTGGYTRMW 

^VIEYKaNCLPPLPLPRRDSHLTLLNQLLOGLRODIiRQSQE/VS 

NHSSHTAQTPGI 


6337 


76 


524 


BGIQMLSVQPDTKPKGGAGC^ 

ACCDCmXSEVGSTLYXKANLI I^RDYl^FGVTGNCAACSKUI 

PAFBMVMRAKDNVYHU^CFACQICTQRFCVGa) 

TDYBEGLMKEGYAPQVR 


633B 


66 


1349 


APNS E SGTQG PLPT P ANIiF WTRRANP DPTTS MS ATDRMGP KAVP 
GLRIAI^LLGLGTPKSGVQGQEGLDFPBYEGVDRVIIWNAKNY 
KWVFKXYEVIAIiLYHEPPEDDKASQRQFEMRBIj I liELAAQVLED 
KGVGFGLVDSBKnAAVAKXIXSLTEVDSMYWKGDBVIEYDGEFS 
ADTrWJFLIjlJVLEDPV^ K 
I^EH YK^EDAAEEFHPYI PFFATFDS KGAKKLTLKLNE I DFYE 
AFMEBPVTI PDKPNSEEEIVNFA/EEHPJ^TIjRKLKPESMYETWE 
DmiK5IHIVAPAEKADPIX;FKFLETLKAVAQDNTEKPDL3 1 IWI 
DPDDFPLLVPYWEKTFDIDI^IAPQIGVVNVTDAnRLWMEKDDBS 
DLPSAEELEDWIjEDVLEGB INTEDDDDDDDD 


6339 


246 


1813 


NRCDRGGGGQABRQAGQG CRTQGAGPGFGFGHS ffsqgamkafh 
TFCWIiLVFX^SEAKFDDFEDEBDIVEYDUNDFAEFKDVNEDS 
VTBSPQRVI ITBDDEDBTTVELBGQDENQEGDFBBADTQEGDTE 
SEP YDDEE FEGYRDKPDTSSS KNKDPX TI VDVPAHLQNS WES YY 
LEXLMVTGLLAY I MNY I IGKNKNS RIiAGAWFNTHRELLESNFTL 
VGIOTTNKEATSTGKI^ENEniYNLWCSGRVCCEGMLIOLRFI, 
KRG5)IjLNVIjAPJ4MRPVSIX2VQI KVTMl^ 

v K L^JlUSny Duty £. r UoDKP tw GAK.I txbFLJij iiAJL Uia KT^E VTDGMM 
DTKMVHFLTHYADKIESVHFSDQFSGPKIMQEEGQPIiKLPDl'KR 
TLLLTFNVPGSG^ Y P KDMBALLPLMNM V I YS IDKAKKFRLNRE 
GKQKADKNRARVEENFUO,THVQRQBAAQSPJ^EKKPJu?KBRlM 
NEKBPBKQRRLEEAMjRRBQKBXRKKO^fKMKQlKVKAM 


6340 


2 


583 


EACAHTI^OTAFARIiGRARRRPWMSHRTSSTFRAERSFHSSSSS - 
SSSSTSSSASRALPAQDPPMEKAIiSMFSDDFGSFMRPHSBPIAF 
PARPGGAGN 1 KTLGDAYEFAVDVRDFS PED I IVTTStfNHIEVRA 
BKLAADGTVMNNFAHKCOLPEDVD^ 
HPHTBHVQQTFRTBIKI 


6341 


2 


6^5 


KMAVLSAPGIJIGPRII^I^SVGPAVQARGVHQSVATDGPSSTQ 
PAI^KARAVAPKPSSRGBYWAKIiDDIiVNWARRSSLWPMTFGLA 
CCAVEMMHMAAPRYDMDRFGVVFRASPRG^PVMIVAGTLTNKMA 
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H=»Hietidine f I*»Isoleucine, K= Lysine, 
I>-Leucine, M=Methionine, N=Asparagine , 
P^Proline, Q=Glut amine, R^Arginine, 
S=Serine, T=Threonine , V=?Valine f 
W=Tryptophan, Y«Tyrosine, Unknown, *=Stop 
Co don, /^possible nucleotide deletion, 
\s=possible nucleotide insertion) 








PALRKVYDQMPEPRyWSMGSCANGGGYYHYSYSWRGCDR IVP 
VDIYTPGCPPTAEIAIiLYGIIiQIiQRKIKRBRRLQIWYRR 


6342 


2 


1191 


DPRVRAMLATLARVAALRKTCXFSGRGGGRGLW^^ 
KPLEGVKILDLTRVLAGPFATMMLGDLGA^ 
TWGPPFVGTESTYYXSVHRNKKS IAVNIKDPKGVKI IKEIAAVC 
DVFVENYVPGKLSAMGLGYED IDE IAPHI I YCS 1TGYGQTOPIS 
QI^YmVASAVSGI^ITGPBVACMHIAAjra^GQKEAKR»G 
TAHGSIVPYQAPKTKIXSYIVVGAGtWQQPATVCKIIiDLPELXm 
SKYXTMHIiKVHNRKBLI KIIiSERFEEBXTSKWLYliFBGSGVPYG 
P INNMKNVFAEPQ\n jH ircLVMEMEHPTWKISVPGPAVR YSKF^ 
MSEARPFPMjGQHTTHILKEVLRYDDRAIGELLSAGVVDQHETH 


6343 


2 


936 


GTAMVSDKDELNIJ>VI VVDANP I WWGKQAUffiSQPTJjS KCIDAV 
MVLGNSHLFMNP5NKLAVIASHI QESRPLYPGKNORLGDPFGDP 
GHPPEFNPSGSKDGKYBLBTSANEVIVBBIKDIMTKSDIKGQHT 
BTLLAGSLAKALCY1HRKNKEVKDNQEMKSR I LVIKAAEDSALQ 
YMNFMNVI PAAQKQNILIDACVLDSDSGIiLQQAC3)ITGG 
P0MPSLIjQYI»LWVFLPDQIX2RSQI.ILPPPVHVDYI^CFCHR^ 
IBIGYVCS VCI*S I FCNFS P I CTTCBTAFKXS LPP VLKAKKKKLK 
VSA 


6344 


i 2508 


14^ 


OMPTATUSl^GYGfiASPGIAAPSLTPPQIATPHI^QPFPQATR 

G^LLGPPPVGVPMNPSQFNI>SGRNPQKQARTSS3TTPNPJCDSSS 

QTMPVBDKSDPPEGSEEAAEPRMDTPEDQDLPPCPEDIAKEKRT 

PAPBPEPCEA3ELP AKRLRS SEE PTEKEP PGQLQVKAQPQARMT 

VPKQTOTPDLLPFJUmQVIiPRFQPRV IjQVQAQVQSQTQPRI PS 

^^VQPKLQKQAQTQTSPEHIiVIiOX3KQVQPQLQQ3AEPQ10Q^ 

PQVQPQAHSQGPROVQLQQEAEPLKQVQPQVQPQAMSQPPRQVQ 

LQLQKQVQTQTYPQVHTQAQPSVQPQEHPPAQVSVQPPEQTHEQ 

PHTXJPQVSLLAPEQTPVVVHVCGIiEMPPDAVBAGGGMEKTLPEP 

VGTQVSHEBIQNESACGI^VGBCBNRAR^MPGVH 

LQS S DSRAFSTVPLTPVPRPS DS VSSTPAAT3T P3KQALQ FFCY 

ICTASCSSQQEPQDHMSBPQHQQJ^BIQHMSQACa^LLPVpR 

DVX»BTEDEBPPPRRWCNTCQLYYMGDI*IQHRRTQDHKIARQSIiR 

PFCTVaraYFXTPRKFVEffinCSQGHKDKA 

DHFI TVDAVGCFEGDEBEEEDDEDEEE I EVER BLCKQVR S RDIS 

REEWKGSETYSPNTAYGVDPLVPVMGYICPaCHKFYIISNSGAQL 

SHCKSIXfHPENLQKYXAAKNPSP^TTR^ 

TSSGRP PSQPNTQDKTPS KVTARPSQP PI^RRSTRLKT 


6345 


2 


3483 


PRVRTKLII^VNDKKJIYERVGGGPKRIK3IU)VEWEEMIEQLQEKV 
HBI^O#DTIJam,ISAK^ 

ANENAGLQECPRKG I KFQDADVAETPHPMFTKYGNSIiliEEARGE 

IP^ILEWVIQSQRGQIEEIiEHIiAE I LKTQLRRKENE I ELSLLQLR 

KQQATDQK^NIRDNVKMIKLHKQLVHKSNALSAMEGXFIQTX}EK 

QRTI^ISIII^^ArWDKUsfMQLKEQRLKCCSLEKQLHSMKFSER 

R r RELQDR INDLEKERELLKENYDKLYDSAPSAAHEEQWKLKEQ 

QLKVQIAQI»RTAI,KSDLTDKTEII^RI»KTETUX^HIiVQENRBL 

QLQYLEQKQQLDEIjKKR I KLYNQBND I NADELSEALIiLIKAOKE 

QKtfGDLS FLVKVDS E INKDLBRSMRELQATHAETVQELEKTRNM 

LIMOJIKINKDYWBVEAVTRJCMFJILQODYB^ 

ARIHKIiEAQLKDIAYGTKQYKFKPEIMPDnsVDEFDETXHIiERG 

ENLPE I H INKVTFS S E VLQASGDKE PVTFCTYAFYDFEIiQTTPV 

VRGI^PEYNFTSQYLVHVNDLFIjQYIQKNTITIjEVHQAYSTEYE 

TIAACQLKFHEILEKSGRIPCTASLIGTKGDIPNFGTVEYWFRL 

RVPMIX^IRLYRERAIGVICTITS^KGPEHKQSr^C^APKTAOL 

SSTDSTIX^LNBLHITIRCX'NHI^ 

DHDTAI I PS SNDPQFDDHMYF PVPMNMDLD R YLKSBSLS FYVFD 
DSDTQENI YIGKVNVPLI SLAHDRCI SGIFBI.TnHQKHPAGTIH 
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Amino acid segment containing signal peptide 
<A«Alanine, C^Cysteine, D-Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H«Histidine, I=Isoleucine, K=Lysine, 
k= Leucine, M=Methionine, N-Asparagine, 
P= Proline, Q^Glut amine, H=rArginine, 
S=Serine, ^Threonine, V^Valine, 
W^Tryptophan, Y=Tyrosine, X«Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VIIiKWKFAYIJ?P8GSITTEDIiGWFIRSfiEPEVVQRIiPPASSVST 
LVIAPRPKPRQRLTPVDKKVS FVD IMPHQSDVSQEGSUDEVKKN 
TEKMQQGKDDVSLLSEGQLAEQ SLAS SEDBTBITEDLEPEVEED 
MSASDSDDCI I PGPISKNI KQPS EKI RIEIIALSLNDSQVTMDD 
TIQRLPVECRPYSLPABBTPVSLPKPKSGQWVYYNYSNVIYVDK 
ENNKAKRDIL KAI IXJKQBMPNRS LRFTWSDP PEDBQDIjKCED I 
GVAHVDLADMPQEGRDL I EQNIDVFDARADGEG IGKIoRVTVEAIi 
HALQ3VYKQYRDDLEA 


6346 


2321 


533 


QDRRT»TiRT>KTQKTCQPTSTMSGSHTPACGPPSALTPSIWPQEIL 
AKYTQKBKSAEQPBPyYDEFGPRVYKEKGDEPGSSLLANSPLMK 
DAPQRI^WQAHI^FTHNHDVGDLTWDKIAVSLPRSBKI^LVLA 
GIPHGMRPQLWMRIxSGAIXJKKRNSBLSYREIVKNSSNDETIAAK 
QIBKDLLRTMPSNACFASMGS IGVPRIiRRVUtALAWLYPE IGYC 
QGTGMVAACLLLPLKBEBAFWMMSAI IEDLLPASYFSTTLLGVQ 
x oy K v LiKHLi-L vy iLiPRIjDKIjJLiQEHD I EI»IjITLHwFL»xAFASVV 
DIKLIJ^IWDLPFYEGSRVLFQLTIX?MLHLKEBRLIQSENSASI 
FNTLSDIPSQMEDAELLLGVAMRLAGSLTO 
IADQGQLI/aAGTLTNI^Q VVRRRTQRRKS TITAIJ^FGKDD 
KAKNI KQTKLVADUlEAILRVARHPOCTDPKNCSVVSRQLPGrtr, 
PNTALTPPTPLVGLYSLWQELTPDYSMBSHQRDHKNYVACSRSH 
RRRAKAI^FBOffiDDDEJ^FRKHDI ITI VSQKDKHCWVGBLNGI* 
RGWFPAKFVEVLDER3KE YS I AGDDSVTEGVTDLVRGTL CPALK 
ALFBHG£iKKPSIJXKSACHPNLFIBBAAGR£VBRX)PA5VYSRL 
CKTFRIJDEDGKVLTPEISLLYRAV^ 

ICVGLNEQVLHLWLEVLCSSLPTVEKWYQPWSFLRS PGWVQI KC 
ELRVLCCPAFSLSQDWEIjPAKREAQQPLKEGVMMIjVKHHLFSH 
DVDG 


6347 


2921 


533 


QDRRLLRLBLQKTCQPTSTMSGSHTPACGPFSALTPS iwpqeil 
AKYTQKER3AE QPEFYYDBFGFR VYKEKGDEPGSS I»LANS PLME 
DAPQRJiRWQAHLEFTBtfHDVGDIiTWDKI^^ 
GIPHGI^PQLWMRIjSGALQKKRNSEIiSYREIVKNSSNDETIA^ 
QIEKBIJ.RTMPSKACFASMQSIGVPRJjRRVIjRAIAWLYPEIGYC 
QGTGIWAACLnLFLEEEnAFWMKSAI IEDLLPASYFSTTLLGVQ 
TIXJRVLRHLIVOYLPRI^ICIil^KHDTRT.RT.T'rr.HWTTT.TTVPAQVV 
DIKLLLRI WnLPFYBGSRVLFQLTIXSMLHIJtBEBIi IQSEN3AS I 
FNTLSDI PSO^^EDABrjIiliGVAMRLAGSLTDVAVETORR 
IAIXXK31»I/3&GTLTNIiSQVVRRRTQR 

KAKNI KQTEIiVADLRFJUlLRVARlIFQCTDPKNCS WSRQLTOLL 
PNTALTP PTPLVGLYSLWQE LTPD YSME SHQRDHENYVACSRSH 
RRRAKALLDFERHDDDEI^FRXNDIITIVSOKDEHCWVGELNGI, 
RGWFPAKFVEVLDBRSKEYS IAGDDSVTBGVTDLVRGTLCPALK 
ALFEHGLKKPSLLGGACHPWIjP I EEAAGREVERDFASVYSRLVL 
CKTFRI^EDGKVLTPBBIXYRAVQSVNVTIH)^^ 
IC^GLNEQVUmWLEVK^LPTVEKWYQPWSFLRS 
ELRVLCCFAFSI^QDWErjPAKl^O^PLKEGVRDMLVKlIin^FS 
DVDG 


6348 


3 


3679 


agabkcfvtliacfiakqqnot 

kiaeqlkqaeei^qykvlvhsqerbltqlrekijiegrdasrsln 

bhu3aiiltpdepdksqgqdi^eqlaegcrlaqhlvqkl*spendn 

dddedvqvevaekvqkssspremqkabekevpedslbecaitcs 

nshgpcdsnq piiknt k i tfeedevns tl wdresshdecqdaln 

ilpvpgptssatnvsmwsagplsgekaaihii^ineklrpqla 

ekkqqfrnlkbkcfltqij\cflanqgn 

erqfkeekiaeqlkqaeelrqykvlvhsqereltqlreki^regr 

DASRSLNEHLQAlJjTPDBPDKiSOGQDIjQE^ 
LSPENDNDDDEBVOVEVAEKVQKSSAPPJBMPKAEEKEVPEDSLE 
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Amino acid segment containing signal peptide 
(A»Alanine f C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Histidine r I=Isoleucine, K= Lysine , 
L=Leucine. M^Methionine, N=Asparagine, 
P^Proline, Q=Glut amine , R=*Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ECAITCSWSHGP YDSNQPHRKTKITFEEDKVDSTIjIGS SSHVEW 
EDAVHI IPENESDDEEEKE KGPVS PRNLQES EBEEVPQES WDEG 
YSTI^ IPPEMIJ^KS YSSTPHSLEEQQVCMAVDIGPJmWDQVK 
KEDHEATGpRLSREU^EKEPEVLQDSLDRCYSTPSGCIiELTDS 
CQPYRSAFYVIiEQQRVGLAVNMDE IBKYQEVBEDQDPS CPRLSR 
EIiLDEKEPEVLODSLGRCYSTPSG YIiKtiPDIiGOP Y5i<? RW^TiPft 
QYLGLALOVDRIRICDQBEEEDCKJPPCPRI^ 
SL»RCYSTPSSCIiEQPDSCOPyGSSFYAI.EEKHVGFS LDVGEIK 
KKGKGKKRRGRRSKKKRRRGRKEGEEDQNPPCPRLSRBLLDEKG 
PEVLQDSLDRCYSTPSGCLELTDS CQPYRSAFY I LEQQRVGLAV 
DMDE I EKYQEVEE DQDP S CPRLSGELLDEKEPEVLQES LDRCYS 
TPSGCLBLTDSCQPYRSAFYILEQQRVGLAVDMDEIEKyQEA^EE 
DQDPSCPRtiSRELIJJBKEPBVIiQDS LGRCYSTPSGYLBLPDLGQ 
PYSSAVYSraBQYMLAIJ>VDRIKKDQBB 

LEWBPBVI^DSLDRCYSTPSSCLEQPDSCQPYGSSKYAL.EEKH 
VGFS LDVGEI E KKGKGKKRRGRRS KKERRRGPJCEGEEDQNP PCP 
RIiNSMLMEVEEPEVLQDSLDICYSTPSMYFELPDSPQHYRSVFY 
SFKEEHISFALYVDNRFFTI»TVTSIiHI>VPQMGVI FPQ 








6349 


3 
• 


3679 


AGAKKCFVTLIACFLAKQQHKYRYEE CKDL I K3MLRNELQPKB E 

KLAEQLKQABELRQYKVLVHSQEREI»TQIiRBKI»RBGPJ^ 

I3ILQALLTPDEPDKS<X^DLQEQIJ^GCRIiA0HLV0K1jSPKNDN 

DDDEDVQVEVABKVQKSSS PREMQKAEEKEVPEDSLKECAITCS 

NSHGPCDSr^PHKtniKITFBEOEVNSTLVVDRESSHDECQDiALN 

ILPVPGPTSSATNVSMW S AGP LSGEKAAINf I LE I NEKLRPQLA 

BXKQQFRNUCEKCTLTQIACFLANQQNKYKYEECKDLI KFMLRN 

ERQFKEEKXAEQLKQAEELRQYKVLVHSQERELTQLREKLREGR 

DASRSLNEHLOALLTPDEPDKSQGQDLQEQLABGCRLAQHIiVQK 

I^PENDITODDBDVQVEVAEKVQKSSAPREMPKAEEKEVPBDSLE 

ECAITCSNSHGPYDSNQPHRKTKIOrFEEDKVDSTblGSSSHVEW 

EDAVH 1 1 PENESDDEEEEBKGPVSPRNLQES EEE EVPQES WDEG 

YSTLS I PPEMIASYKS YSSTFHSLEKQQVCMAVDIGRHRWDQVK 

KEDHKATG PRLSRELLDEKGPEVLQDS LDRCY STPSGCLELTDS 

CQP YRS AFYVI^QQRVGIAVNMDRI EKYQEVEEDQDPS CPRLS R 

EIJ^EKEPEVLQDSLGRCYSTPSGYLELPDr^3QPYSSAVYSLEE 

QYLGLALDVDR I KJO)QBBBBDQGPPCPRLSRELLEVVEPBVLQD 

SLDRCYSTPSSCJLBQPDSCQPYGSSPYALBBKHVGFSIiDVGBIE 

KKGKGK10UiGRRSKKERRRGRXBGBBDQNPPCPRIi5 RELLDBKG 

PEVLQDS LDRCTSTPSGCLBLTDSCQP YRSAFY I LEQQRVGLAV 

DMDEIBKYQEVEBDQDPSCPRIiSGBI,lU)KKKPEVLQESLDRCYS 

TPSGCI*ELTDSCQPYRSAFYII*EQQRVGIAVI)MDEIEKYQEVES 

DQDPSCTRLSP^LI^EKBPEVLQDSLGRCYSTPSGYLELPDLGQ 

PYSSAVYSLEEQYIiGIiALDVDRtKKDQBBBEDC^PPCPRIjSREI* 

LEWEPEVI^DSIJDRCYSTPSSCLEQPDSCQPYGSSFYALEEKH 

VGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGKEDQNPPCP 

RLNSMl^EVEEPEVLQDSLDICYSTPSWYFELPDSFQHYR^V 

SFEEEHISFALYVmRFPTLTVTSLHLVFQMGVIFPQ 


6350 


3 


3679 


AGAEKCFVTLLACFIiAKQQNKTKYEECKDIiI KSMLRNBLQFKEE 
BXAEQLKQAEELRQYKVI>VHSQERELTQLREKIiREGRDASP^SIiH 
EHLQALLTPDEPDKSQGQDI^EQIAEGCRIAQHLVQKI>SPENDN 
DDDEDVQVEVAEKVQKSSSPREMQKAEEKEVPEDSIJBECAITCS 
NSHGPCDSNQPHKNI KI TFBEDEVNSTLVVDRESSHDECQDALN 
ILPVPGPTSSATNVSMVVSAGPLSGEKAAINILEINEKLRPQLA 
EKKCKJFRNLKEKCFLTQIjACFIANQQNKYKYRECKDLI KFMLRN 
FJIQFTCEEKLAEQLJCQAEELRQYH^VHSQERELTQLJ^EKLREGR 
nASRSL^EHLQAIXTPDEPDK^QGQDLQEQLAEGO^AQHLVQK 
LS P ENDNDDDED VQVEVAE KVQKS S APREMP KAEEKE VPEDSLE 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
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Predicted end 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A^Alanine, C=Cy3teine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine # G=Glycine, 
H*Histidine, I=Isoleucine, K=Lysine, 
I>= Leu cine, M=Methionine, N^Aspa r agi ne , 
P=Proline, Q=Glut amine, R=Arginine, 
S=Serine, T-Threonine, V- Valine, 
W=rTryptophan, Y=Tyrosine, X^UnJcnown, *-Stop 
Codon, /^possible nucleotide deletion, 
\<=poasible nucleotide insertion) 








BCAITCSNSHGPYDSKQPHRKTKITFBEDKVDSTIiIGSSSHVEW 
EDAVH 1 1 PENESDDEEEBBKGPVSPRNLQES EEREVPQESWDBG 
I STL IjS lrPKWbAbl Kb 1 biA rHbljBfi^VCHlW UlGRnKWDUV K. 
KEDHEATGPRLSRE LkDEKGPBVLQDS LDRCYSTPSGCLELTDS 
CQPYRSAPYVI^aRVGLAVl^EIEKYQBVEEXJQDPSCTRIiSR 
ELLDBKEPEVLQDSLGRCYSTPSGYLELPDLGQPYSSAVYSIiEE 
QYLGIiAlJ)VDRI KKDQEEEBIWPPCPR^ 
SbDRCYSTPSSCLBQPDSCX)PYGSSFYALKEKHVGFSriDVGBIE 
K KG KGKKRRGR RS KKKKRRGR K EGR KDQN P P CPRLSR ELLDE KG 
PEVLQDSLDRCYSTPSGCbELTDSCQPYRSAFYILEQQUVGtAV 
DMDEIEKYQEVBEX>QDPSCPRLSGELLDEKEPEVLQESLDRCYS 
TPSGCLELTDSCQP YRS APY I LEQQRVGIAVDMDE I E KYQEVEE 
DQDPSCPRLSREIJLDEKEPEVLQDSI/3RCYSTPSGYIiELPDLGQ 
P YSSAVYSLBEQYGGIiALDVDR I KKDQBEEKDQGPPCPRIjSRKIi 
IiEWE PBVLQDSLDRCYSTPS S CUEQPDSCQP YGS S FYAIjEE KH 








RI^SWLMEVBEPEVIiQDSLDICYSTPSMYFELPDSFQHYRSVFY 
S FEB EH I S FALYVD^FFTLTVTSLHLVFQMGVIFPO 


6351 


17.91 


319 "■" 


REAKJ^TER2Qr^RMrjVVEV7^GRSLVWGARAVQALRERIiGVC4<:3 
RTVGALPRGPRQNSR^LPLGI^PBBARLIAEIGAVTIjVSAPRP 
DSRHHSLiALTSFK^QQEBSFQEQSALAAEARBTRRQBIiLEKITB 

gqaakkg^cleqasgasssqeagssqaakeijetsdgqa^ 
gpsssqagpsngvaplprsalijvqiatarprpvkarpiidwrvqs 
kdyiphagrpahelrys i yrdl.w ergfflsaagkfggdfiivypgd 

PLRFHAHY1AQCWAPBDTI PI£DLVAAGRI/3TS VRKTLLLCS PQ 
PDGKWYTSLQWASLQ 


6352 


235 


923 


WSBWI^PCHAAKCKGI^MLRITMKTRAISIAADAT^ 

AMAR S LVHiyWFYCljS VYQVK IS PTPQLGAAS SAEGHVG QGAPG 

LMGNMNPEGGVNHENGMNRIXX3^ 

PAQAAMEGPQPBNMQPRTRRTKFTI*LQVEELES VFRHTQY PDVP 

TRRELAElTLGVTEIDKVRVWFKNKPJ^CPJi^ 

DDCVYIWD 


6353 






RFAGAGAI PEARAR PPDVQAAEEEKEMDLPDS AS R VFCGR I I>S M 
VNTDDVNAI ILAQKNMLDRFEKTNBMIJtNFNNIjSSARIiQQMSER 
FIifflTRTLVEMKRDIJ)6 1 FRRI R^n^^KLARQHPBAFSH I PEAS 
FLBBEDEDPIPPSTTTTIATSEC^TGSCDTSP1)TVSPSIjSPGFE 
DLSHVQPGSPAIKGRSQTDDEEMTGE 


6354 


965 


510 


PSIiRPMEPTRDCPLFGGAFSAILPMGAIDVSDLRPVPDNQEVFC 
HPVTDQSLIVEI»LEI^AHVRGBAAARYHFBIWPC?VQG 
SVQPLSLENLAI»RGRCQEAWVLSGKQQ 
IiIiRLPQYQTDLLLTFNQPP 


63*5 


158 


1662 


RG3SAAFRGSGLRGAMIRRVLPHG>»RGIiIiTRRPGTRRGGFSLD 
WDGKVSE IKKKIKS ILPGRS CDIiLQDTSHLPPEHSDVVXVGGGV 
T,f!r JTOAYWT.KTTTiES RRGATR VT.WRRDHTYSQASTGLS VGGI CO 
QFSLPF^IQLSLFSASFLRNINF^n^VVBAPPLDLRFNPSGYLL 
LASEKDAAAMESNVKVQRQEGMCVSL^ 

ALASYGMEDEGWFDPWCLLQGLRRJCVQSI^ s 

SQRMLTTDDKAVVLKRXHBVHVKMDRSI^YQPVB 

WSAQIAALAGVGEGPPGTIiQGTKLPVBPRKRYVYVWHCPQGPGL 

ETPGVADTSGAYFRREGrjGSNYLGGRS PTEQEEPDPANLEVDHD 

FFQPKVWPHIiAIiRVPAFBTIJ^ 

PLV\TOMYFATGFSOTGU»APGIGRAVAB 

EFTRFYLGE KI QENNI I 


6356 


354 


633 


TGLTSSCLPLQVMMTKRTKDMGKFSSVTVSTIDEBKEEXEAREV 
ADS YAQNAKVI EKQLERKGMS KRRIjQBIiAELEAKKAKMKGTLID 
NQFK 
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beginning 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
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Predicted end 
nucleotide 
location 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D^Aepartic Acid, B= 
Glutamic Acid, F-Phenylalanine, G^Glycine, 
H=*Histidine, I»Isoleucine, K= Lysine, 
L= Leucine , M=Methionine, N»Asparagine , 
P= Proline, Q=Glut amine, R^Arginine, 
S=Serine, T=Ttoreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown # *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 


€357 


2 


915 


GLLRNMALLVRVT jRNQTS I SQVTVP VCSRL I PVSPTQGQGDRALS 

RTS QWPQMSQS QACGG 3EQIPGI D I QLiniKyHTTRKLSTTKDS P 

QPVBBKVGAFTK1 IKAMGFTGPLKYS KHKJK IAALRMYTS CVEK 

TDFEEFFLRCQMPOTFNSWFIiITTjLHW 

CRI IVHFMWKDVQQRGRVMGWPYIIJuXNMIIJfrN^ 

DKG ILSDDKGIiAAALWRTPPNRKGEDPRHLELLVBYVRKQIQyL 

DS MNGEDLLLTGEVS WRPLVE KNPQS ILKPHSPTYNDEGL 


6358 


2009 


1040 


AS DALHSL5 AP VLRLS S RS AARPATMTEQAI S FAKD FLAGG I AA 
AISKTAVAPIERVKLLIjQVQHASKQIAADKOYKGIVDCIVRIPK 
EO^JVLSFWRGNLANVIRYFPTOAIjNFAFKDKYKQI flggvdkht 
QFWRVTAGNLASGGAAGATSLCFVYPl^PARTRLAADV^ 
REFRGLGDCLVKITKSDGIRG1.YQGFSVSVQGI IIYRAAYFGVY 
DTAKGMLPDPKNTHI WS HMIAO/nrTAVAGVVS YPPD*rVRRRTW 
MQSGRKGADIMYTGTVDCWRKI PRDEGGKAFTKGAWSNVLROMG 
GAFVLVLYDBLKXVI 


6359 


98 


1086 


VCRQEKEKMKBDCLPSSHVpISr^KSIQKSBIMLliKTYNCYHB 
GKSFQLRHRKEBGTLI I EGLIN IAWGLRRPI RLQMQDDREQVHL 
PSTSimPI®PSCPOCBPSPQNGKITAQGPSIQPVHKABSSTDSS 
GPLEKAEKAPQIjHRTKSDASCMSQRRPKCRAPGKAQR IRRHRFS 
INGHFYNHKTSVFTPAYGSVTNTOVNSTl^rTL^ 
EIX3PSEFALYIVHKSGBRTKLKDCBYPLISRILHGPCBKIARIF 
LMEADLGVEVPHEVAQYI KFEMP VLDSFVBKLKEEEERE I IKLT 
MKFQALRLOWLORLEQLVEAK 


6360 


1 


345 


GTRGAVPSTLEKWL.PPRSCRVPWIHSGT'I'MSKVSFKITIiTSDP 
RLPYKVLSVPESTPFTAVLKFAAEEFKVPAATSAI ITNDGIGIN 
PAQTAGNVFLKHGSKLRI I PRDRVGSC 


6361 


615 


158 


RPGI^QIX3HCAI^QAGNRRCRPHGRl^Ar.TRSrHRGKPMS IMQ 
FKDTLNTPLPDSS P VAVPLGAP IAVASTLSVEHNDGVETGI WAC 
APGRWRRQITSQBFCHPIO^ROTFTPDIXJETLHIQAGTIALMLPA 
NSTG I WDIQBTVRKTYVIi 1 1» 


6362 


350 


1576 


TTMDGSHSAALKLQQLPPTSSSSAVSKASFSYKENLIGALLAI P 
GHLWSIALKLQKYCHIRLAGSKDPRAV FK'i'K'l'WWLGLPLMLLG 
ELG VFAS YAFAPLSL rVPLSAVSVIASAI IGI IPIXEKWKPKDF 
LRRYVLSFVGCGIJttAJGTYTiLVTFAPNSHEKM^ 
PFLLYMLVBI II^CXLLYPYKBiaiAtWlVVXIiIiVAI^ 
TVKAYAGMLVLS IQGNLQLD YP IFYVMPVCMVATAVYQAAFLSQ 
A3QMYDS3LIASVGYIL3TTIAITAGAIPYLDFIGEDVLHICMF 
ALGCXIAFT/3VTLITRMRKIC? I PFEP YISMDAMPGMQ$IMHDKGM 
TVQPELKASFS YGALENNDN I S EI YAP ATLFVMQEBHGS RS ASG 
VPYRVLEHTKKK 


6363 


21 


1201 


RRTPJ^SSFPRRRDSSAMESYDVIANQPVVIDNGSGVIKAGFAG 
DQIPKYCPPNYVGTIPKHVRVMAGALEGDIPTGPKABKHRGLLS I 
R YPMBHG IVKD WNDMER I WQ YVYSKDQLQTFSEEHPVLLTEAPL 
NFKiU^KJiKAAISV Jr FJSTrW VFAUr JLoWs^VijoljXAiTjrKl XliV Via? 
SGDGVTHAVPr^EGFAMPHSIMPJI^IAGRDVSRFl^YLR^Y 
DFHSS SEFEIVKAI KERACYItS INPQKDETLBTBKAQYYL.PDGS 
TIEIGPSRPRAPEWiFRPDLIGEKSEGIHEVLVPAIQKSDMDLR 
RTLFSNIVLSGGSTLP^GFGDRIXSEVKKLAPKDVKIRISAP^ 
RIiYSTWIGGS ILASLDTFKKMWVSKKE YEEDGARS IHRKTP 


6364 


21 


1201 


RRTRLGSS F PRRRDS SAME S YD VI ANQ PWI DNGSGVT KAGFAG 
DQ I PKYCF PNYVGRP KHVR VMAGALEGD IFIGPKAEBHRGLI jS I 
RYPMKHGIVKDWITOMERIWQYVYSKDQIjQTFSEEHPVLLTEAPT. 
NPRKNRERAAEVFFETFNVPAIiFI SMQAVLS LYATGRTTGWLD 
SGDGVTHAVPI YEGFAMPHS IMRIDIAGRDVSRFLRLYLRKEGY 
DFHSSSEFEIVKAIKERACTI^lOTQKDETl^^ 
TISIGP$RFRAPELL?RPDLIGEESEGIHEVLVFAIQK£DMDLR 
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amino acid 
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Predicted end 
nucleotide 
location 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=*Alanine , OCysteine, 0=Aspartic Acid, E~ 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H=*Histxdine, I=Isoleucine, K=sLysine, 
L=I*eucine, M^Methionine , N=Asparagine , 
P=Proline, Q^Glut amine, R J =Arginine, 
S^Serine, T=Threonine, VaValine, 
W^Tryptophan, Y^Tyroeine, X^Onknown, *-Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








RTLFSNIVLSt^STLPKGFGDRIJiSEVKKLAJPKDVKIRI SAPQE 
RLYSTWIGGS II^nDTFKKMWVSKKEYEEDGARSIHRKTP 


6365 


234 


19B9 


KHKSRASCAAHAOAKGPSRERJE^mSRFUSGIJiRIiGB^SGCC™ 
ASMGTLAFDBYGRPFL 1 1 KDQDPJCSRI/^LEALXSHiyAAKAVA 
NTHRTS IiGPNGLDKKMVDKIX3ITVTVT^^X^TIIl3MMDVDHQIAK 
LMVBI^KSQDDBIGDGTraVVVIiAGJu^riBBA^ 
ADGYBQAARVAI BHLDKI SDSVLVDIKDTEPL I QTAKTTLGSKV 
WSCmQMAEIAVNAVLTVTVDMBRRDVDFELIKVEGKVGGRLED 
TKLIKGVIVDKDFSHPC^PKKVEDAXIAII.TCPPEPPKPKTK^ 
LDVTSVEDYKALQKYKKEKFEBMIQQI KHIGANIAIOQWGFDDE 
ANILLLLQNNL PAVRW VGG PE I ELI A IATGGRIVPRPS EI/TAE KL 
GPAGLVQBISFGTTKDKMLVXEQCKNSRAVTIFIRGGNKMIIEE 
AKPJS LHDALCVT RNL IRDNR WYGGGAAE IS CALAV5QBADKCP 
TLBQYAMRAPADALEVI PMAI>SENSGMNPIQTMT*EVRARQVKEM 
l^AI^riK^KGTNDMKQQm^IETLlGKKQQISIiAlXjMVRMILK 
1DDIRKPGESEE 


6366 


257 


1898 


GNKisGAHSSTFW VLLS I FIjGAVAMIiCKEQGITVLGLNAVFliilfjV 
IGKFIWLEIVQKVLHKDKSLENIXSMI^^ 

^YVRWRIMGTGPPAPTEVDNPAS FADSMLVRAVNYNYYYSIiNA 
WLIXCPWWLCFDWSMGCIPLIKSISDWRV1ALAALWFCLIGLIC 
QAIiCSEDGHKRR I LTLGLGFIATI P FIjPASKLFFRVGFWAERVTj 
YLPSVGYC^IiTFaFGALSmTKJaaOjIAAVVI^ILFINTIJRCV 
LRS GEWRSEEQIjFRS AI>S VCP1JNAKVH YNIGKNLADKGNQTAAI 
RYY RFAVRLNPKY VHAMNNLGNILK£R1TOLQEAEELIiS LAVQIQ 
PDFAAAWMNLGIVQ^LKRFEAAEQSYRTAIKHRRKYPDCYYNL 
GRLYADl^RH VDALNAWRNATVLKPEI ISIAWNNMI I LLDNTGNL 
AQAEAVGREALEL I PNDHSU4FSLANVLGKSQKYKESEALFLKA 
IKANPNAASYHG6T[AWjYHRWGHLDLAXKHYEISLQIiDPT71SGT 
KBNYGLLRRKLBIMQKKAV 


6367 


287 


1934 


SIGFPVMLVLSILLYTCBMFODSVAFKDVAVSFTQKKWALLDPS 

QKNLyRrrVMQBTFlCNLTSVGKTWKVQHIEDE 

KIXESKESHHCGBSFroiMJDMIiNRKra 

GHSSIjrmiRAD'IX;HKSSP^OKYGFOTYRNKKCKKAFSYLDSFQ 

SHDKAGTKEKPYDGKECTETFISHSCIQRHRVMHSGDGPYKCKF 

CGK7tf?YFLNLCI*IHERIHTGVKPYKCKQC&^ 

THTGVNADBC^CGNAFSFPSEIRRHKRSHTGBKPYBCKQGGKV 

PISFS S IQYUKMTIFrGBKPYBCKQ(X3KAFRCGSHI^]aiGRTHTC 

EKPYECR(KX3KAFRCTSDLQRREKTIiTKDKPYGCKQCGKGFRCA 

SQLQIHERTHSGEKPHECKEOGKVFKYFSSIJttHERTirrGEKPH 

ECKQCGKAFRYFSSIiHTEIFJlTE^DKPYECKVCEKAFTCSSSIR 

YHERTHTGRKPYECKHCGKAFISNYIRYHERTHTGEKPYQCKQC 

GKAFIRASSCREHERTHTINR 


636B 


1 


327 


RPVPAKLNPRSWPRTAGALPLRPPPLTMAVFIIDEVEIEDPQYDE 
BQFVCGETVPAPSANKELVKC 


6369 


1 


1745 


AGCCRDTRFPTPRGPGS LCHNFCRSAACTVTRT IHG S PREDTGT 

PRSREMMFQDSVAFEDVAVSFTQEEWALLDPSQKNLYRDVMQET 

FKNt/TSVGKTWKVQNIBDEYKNPRJ*NI>SL^^ 

ESFNQIADDMLNRKTLPGXTPCESSVCGEVGTGIISSLNTHXRAD 

TGHKSSEYQEYGBNPYRNKECKKAPSYLDSFQSHDKACTKEKPY 

DGKECTBTFISHSCIQRHRVMHSGDGPYKCKFCGKAFYFimCl, 

IHERIHTGVKPYKCKQCXSKAFTRSrTLPVHERTHTGVNADEaCE 

CGNAFSFPSEIRRHKRSHTGEKPYECaCQCGKVFISFSSIQYHKM 

THTGEKPYECKQCGKAFRCESHLQKHGKTH'iGEKPYECRQCGKA 

FRCTS DLQRHEKTHTEDKPYGCKQCGKG FRCASQLQIHERTHSG 

EKPHECKEOGKVFKyFSSLRIHERTHTGBKPHBCKQCGKAFRYF 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A- Alanine, C=Cysteine, D=Aspartic Acid, E- 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Histidine, I«Isoleucine, K=Lysine , 
L=Leucine, M=Methionine, N=^Asparagine , 
P= Proline, Q=Glut amine, R^Arginine, 
SaSerine, T^Threonine, V»Valine, 
W=Tryptophan, Y-Tyrosine, X=Untaiown, *=3top 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SSLHIHERTHXGDKPYECKVCGKAPTCSSSIRYHERTHTGBKPY 
BCKHCX5KAFISNYIRYHBRTHTGEKPYQCKQCGKAPIRASSCRE 
ItERTHTINR 


6370 


1711 


329 , 


FVLS EQRLRTERTW PRS PGLGRGAAAAGARTAGAGLLRLLiLGCG 
ALVQQLRPVl>ITTPANA(aiAS KTWBLSLYEIjHRTPQKAI MDGTE 
lAVSPR^IJfSEl^CPICLDKIiKi™ 

S GWKBCPTCRKfCLVS KRSLRPDPNFDALISK I YPSKKE YKAHQD 

RVLIRI>SR1>HNQ0AI>SSSIEE^IJWAMHRAQRVRRPIPGSD^ 

TTMSGGBGE PGEGEGDGEDVSSDSAPDSAPGPAPKRPRGGGAGG 

SSVGTGGGGTGGVGGGAGS EI>SGDRGGTXGGGTIjGPPS P PG APS 

PPEPGOTIELVFl^HPIiVEKGEYCOTRYV^ 

I ALR I ALERRQQQEAGE PGG PGGG AS DTGG PDGCGG EGGGAGGG 

IX3PEEPALPSLEGVSKKXJYTIYIAIH5GGAm^NGSIjTIJeLVNB 

KFWKVSRPLKLCYAPTKDPK 


6371 


3 


2B8 


GVANMSTAMNFGTKSPQPRPPDKGSFPLDKLGHCKSPKEKVMKC 

LHMNNFENAI*C!RKBSKBYLECRMERXLMI^ 

KSRAXK 


6372 


2l4l 


625 


RVSAIASBGKAEl^YKKTiKDIJ.KKSFSLVKMPSLQPVVMCVMECH 
LPKVP EKKXKLVMAD KELYRACAVEVRRQ I WQDNQALFGDKV S V 
LLKQYILKKESAIjFSTKLSVLiHNFFS PSPKTRRQGBWQRLTRM 
VGKNVKLYDWVLQFLRTLFXiRTRKWY 

EI CTVDPCHKFTVilCLIlAC I RRRFVDS KRAMLQGFLiDGVKKGQE 
QVLGDLSMILCDPFAIlfrLAIiSTVPJ^ 

U^RUJ^QGAWDKIDSQWKEPKMKVELITRFLPMLMSFLVD 
DYT FN VDQKLP AEEKAPVS YPNTLPES FTKFLQEQRMACEVGLY 
YVLHITKQRNKNAU>RLtiPGLVETFGDlAFGDI FLHLLTGNIiAL 
LADEPALBDFCS SLFDGFFI.TAS PRKENVHRHALRI>IjIHLHPRV 
APS KLEAIjQKALEPTGQ SGEAVKEI. YSQI^GB KIiEQLDHRKPS PA 
QAAKTPALKLPLPSVPAPAPIi 


6373 


67 


711 


PS RAARAS PARL3AMVS W I 1 SRL WL I FGTLYP AYYS YKAVKS K 
DIKEYVKWMMYWIIFALFTTABTFTDIFIjCWFT^^ 
WLLSPYTKGSSLLYRKFVHPTIjSSKEKE IDDCLVQAKDRS YDAIt 
VHFGKRGLNVAATAAVMAAS KGQGALS ERLRS FSMQDI/TT IRGD 

gapapsgppppgsgrasgxhgqpkhsrsasbsasssgta 


6374 


S35 


2105 


R^FCSYXSTSEFPSSTRR^SCFIirrFCigYTSSTIFL^TRDHS 
CPTHTFCNYTSSTI EliS^TRlJHSCFnrrSCCIY'rSSTIFLSSTRD 
HSCPTHTSCNYTSSTIFI>SSTRI^CPTHTFCNYPRPIIRLSSC 
CPAHTJ?TEGZm}QUNI£GFQVVl£DTVLPPEGGGQPnDIlCT IN 

dxsvi^vtrp^eojujhftoppldpgsqvlvrvdwerrfijhnqq 

SGQKLI TAVADHLFKLKTTS WEIjGRFRS Al EIiDTPSMTAEQVAA 
XBQSVNBKXRl>RItPVNVRBliSIjDDPEVEQVSGRGIiPDDHAGP IR 
WNIBGVDSlMCrcTHVSNIiSDIiQVlKrLG^ 
SGITCWTjKNMERSHGTEKALTAIJLKC^ 

ANEIGSE^LLFLWGDBKGGGLFLIiAGPPASVETIjGPRVARVL 
EGXGAGKlCGRPOGKAlT<MSRRMBAOAr.LQDYISTQSAK3 


6375 


1 


1535 


aimaaatrpvrlpeagcegrercwnpsrsrshsgegglaawsrt 
cpgrppjtpggovvrgpt^vtayia^ 
ppgracsnpsflrfqi^fyqvyfnalaadwlqapyl 
flegqraxlyvcgiastvlfgiivassdvdwlgrkns cvlfslt y 
suxxtki»sqi^fvi^vgrai^i^tai^fsafrawyihehver 
hd fpaewi pat faraafwnhvlavvagvaafavas w igi^pvap 

FVAAIPLLALAGAIJCiRNiraENY^ 

RVLIXGTIQALFESVrPIFVPl.WTPVIiDPHGAPrXJIIFSSFMAA 
SIJjGSSLYRIATSKRYHI^PMHLLSI^VLIVVFSLFMLTFSTSP 
(KJESPVESFIAPI^IEJ^CGLYFPSMSFI^KAaPETF^AGVI^ 
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Amino acid segment containing signal peptide 
(A^Alanine , C-Cysteine, D=Aspartic Acid, K= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Histid±ne, I=Isoleucine, K=Lyaine, 
1^ Leucine, M=Methionine, NaAsparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=. Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X-Uixknown , *-3top 
Codon, /^possible nucleotide deletion, 
\t=possible nucleotide insertion) 








WFRVPLHSIACLGI^VIOTSDRKTGTRWSICSAVMVMAI^ 
VGLPTWRHDAELRVPS PTKEPYAPBL 


6376 


380 


1437 


isstdidhyrpsflvnskmpskbswsgrktnraavhkskqbgrq 
qdlliaalgmklgspkssvtiwqplklfaysqltslvrratlkb 
nkqi pkyekihnfkvhtb^gphwm ycanfmwgliaqgvkcadc 
glnvhkq cs k^wpndckpd^jkhvkkvysc3>lt^livk7ahttkrpm 

VVmCIRBlBSRGI^SEGLYRVSGFSDlirEDVKMAFDRDGEKAD 
ISVNMYEDINIITGALKLYFIU)LPIPLITYT?AYPKPXKSAKIMD 
PDEQLETLHBALKLLPPAHCKTI^ 

ENLGXVFGPTLMRS PELDAMAALIIDIRYQRI*VVBI>LX KNEDILF 


6377 


2311 


1844 


SRIRRRSSRRPREPPGPSRRRRRRRPDPRTMPSEKTFKQRRTFE 
QRVBDVRLIREQHPTKIPVI IERYXG3BKQI.PVLDKTKFLVPDHV 
HMSELIKI X RRR JbQJbNANQAFFLLVNGHSMVS VSTP I SEVYBS E 
KDRDGFL YMVYAS QETFGMKLS V 


6378 


686 


191 


GAGPWEAFPDGIGRRSRRARbPQYKRPPGRVGGGDSGRRNMAVA 
DLALIPDVDIDSDGVFKYVLIRVHSAPRSGAPAAKSKEIVRGYK 
WABYIIADI YDKVSGDMQKQGCDCECLGGGH I SHQSQDKKIHVYG 
YSMAYGPAQHAISTBKIKAKYPDYEVTWANDGY 


6379 


3* 


378 


ERAGSPEPSRAALRRCAPQRSQAPRWPDRAACRRSFOGSQGRAY 
LFNSVVNVGCGPABERVLLTGIiHAVADIYCEHCKTTI^ 
FESSQKYKEGKYI IBIiAHMIKDNGWD 


6380 


1414 


462 


PAVQGO^GAGPP"XXJRGSGWMAR;FALTAWRHGETRENKEKI IQGQ 
GVDBPLSETGFKQAAAAGIPI^IHVXPTHAFSSDL^TKQTMHGI 
LFJRSKFCKDMTVKYDSRLRERKYGVVEGKAIiSEIJiAMAKAAREE 
CPWTPPGGETIJX^KMRGrDFTEFIiOQLIIjKEADQKEQFSO<3S 
PSNCI>ETSlAEIF^U3KmiSSKVNSDSGIPGIiAASVl,VVSHGAY 
MRSLFDYFLTDLKCSLPATLSRSELMSVTPNTGMSLFI IWFEEG 
REVKPTVQC I CMNIiQDHLNGIjTENS IjGWHjPS KSNHFEPLKGVP 
LALFTSLXiC 


6381 


1668 


218 


AVVRAQGSRGFSGLAGWRPRQAAAMNFSEVFKI#SSLLCKFSPDGK 
¥W«CT/gYJ^VVRDVNTI£riX^Y^ 

AMYKRGtiVQ VWS LEQ PEWHCKIDEGSAGLVAS CWSPDGRHI LNT 
TBFHI^ITVWSLCFK^SYIKYPKACI^ 

RDCKDYVS I FVCSDWQLLRHFDTI>TQDLTGlBWAPNGCVIiAVWD 
TCIjEYKILI,YSI^RI^TYSAYEWSLGIKSVAWSPSSQFI»AVG 

s ydgkvr x lnhvtwkm i tbfghpaaznd pki wykraeks pqlg 
lgcls fpppragagplpsskskyeias wvsi^itjcpvtdranp 
kigigmlafs pds yfiatrndnipnavwvwdiqklrlfavleql 

S PVRAFQWDPQQ PRLA I CTGGSRTj YLWS PAGCMS VQVPGEGDFA 


6382 


2 


1062 


PBBDBDRNIX^L I AYPIiKGI>HG ITVDI VDNSDCE PKS KLLRWTTNK 
KBHVLETBKTPKDWVRQHRIOSEKMKS^^ 
TVEKXGNISSQLBGKYNPWSMKCHCKX^RMKF^AKHRNQYKF 
LBlttTSRYEWCVLDLKMGTRQI^^ 

AVIGVR VCGMQVYQAGSGQLM FM^YHGRKLS VQGFKEALFQFF 
HNGRYI,RREIiGPVLKKLTBLKAVLBRQESYRFYSSSLL^ 
KERPEVVIiDSDAE DLEDLS EESADBSAGAYAYKP I GASS VDVRM 
IDFAHTTCRLYGEDTVVHKGQDAGYIFGLQSLIDIVTErSEBSG 
B 


6383 


3159 


1061 


S PAPGR P S P HGSQPAARAAAAPAMPSAKQRGS KGGHGAASPS EK 
GAHPSAARPLAAPTPAAPACRS PS PGGAPAS FPGRAPRS LASQP 
AARAAAAPAMPSAKORGSKGGHGAASPS E KGAHPSGGADDVAKK 
P PPAPQQ PPP PPAPH PQQHPQQHPQNQAHGKGGHRGGGGGGGKS 
SSSSSASAAAAAAAAS3SASC5RRLGRALNFLFYLALVAAAAFS 
GWCVHHVLEEVQQVRJISHQIJFSRQREEIXK^IOGVBQKVQSLOA 
TFGTFESILRSSQHKQDLTEKAWQGESEVSRISEVLQKLQNEI 
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Amino acid segment containing signal peptide 
(ARAlanine, C=Cysteine, D=Aspartic Acid, K= 
Glutamic Acid, F= Phenyl alanine , G=Glycine f 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S~Serine, T«Threonine, V«=Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








LKDLSDGIHWKDARSRDFTSLENTVKERLTKLTKS INDNI AIF 
TEVQKRSQKK INDMKAKVASLE ESBGNKQDLKALKEAVKE I QTS 
AKS REWDMEALRS TLQTMES D I YTBVRBLVS WCQBQQAPKBAAD 
TKRLALQAIiTEKLLRSEESVSRLPEBIHRLBEKI^QIiKSDSHGP 
I^DGGPIU^SEAFKALQOKSQGLDSRLQHVBDGVrjSMQVASARQT 
ESLESLLS ICSQBHBQRIiAALOGRIiBGIiGSSEADQDGLASTVRSI* 
GETQLVLYGDVBBIiKRSVGKLPSTVESLQKVQEQVHTLLSQDQA 
QAARLPPQDPLDRLS SLDNLKAS VSQVEADIiKMliRTAVDSLVAY 
SVKIETNKNNLKSAKGMiDDIJRNDLDRLFVKVEKIHEKV 


6384 


738 


1904 


I WEVPVCLTHLLHLQOANQPLPP PSSS INEBDADEANRAIGEKR 
AAPDSGKKPKTPKTKOX?KDPNE PQKPVSAYAIiFFRDTQAAI KGQ 
HPNATFGBVSQIVASMWI^LGEEQKQVYKRKTEAAKKBYLKAIA 
AYRAStiVSKAAAESABAOTIRSVQQTI^ 

HGTVSASPQTI^SLPRSIAPKPLTM^.PMNQIVTSVTIAAW4P 
SNIGAPL1 SSMGTTMVGSAPSTOTSPSVQTQQHO/1QLQQQQQQQ 
O^MQaMQXX2QIiQQHQMHQQ^ 

LQQQINQC^LGXXJIW^UKJI'QHMQHQSQPSPRQHS PVASQI 
TSPIPAIGSPQPASQOHQSQIQSQTQTQVLSQVSIF 


6385 


2 


1584 


PRVRAADVAAGAQAVVSAGMAKSNGENGPRAPAAGRSLSGTRES 
IAQGPDAATTDBL3SLG5DSEANGPABRRIDKFGFIVGSQGAEG 
AIiBEVPLBVIiRQRBSRWUXlLilQIHDK^ P 
PSLRGPJ\WQY1jSGGKVKLQQNPGKFX)BIJDMS PGDP KWLDVI ERD 
LHRQF PFHKMFVS RGGHGQQDLFRVLiKAYTL YRPKEGY CQAQAP 
IAAVLLMIIMPAKQAFWCIjVQICEKYLPGYYSEKLEAIQLDGEIL 
FSLLQKVSPVAHKHI^ROJCIDPUiYMTEWFMCAFSRTLPWSSVL 
RWDMFTCEGWCI IFRVGLVLLKHAI/5S PEKVKACQGQYETIKR 
UlSLS PKIMQEAFLVQEVVELPVTERQIEREHLIQIiRRWQETRG 
BI^CP^PPRiaOAKAlIJ>AKPGPRPAI^2PSPSIRI*PIjDAPLPGS 
KAKP KPPKOAQ^QRKQMKGPUKJLEKPPAPNOAMVVAAAGDACP 
PQRWPKDSAPKDSAPQmiAPO^SAHTOSQESLTSQESEDTYL 


6386 


819 


195 


"TVC«SFYIX3IMQRASRLKP^UIMIATEPPPGI^0DKI)QMDDr. 
RAQILGGANTPYBKGVFKLEVEIPBRYPPBPPQIRPLTPIYHPN 
IDSAGRICIJ^KLPPKGAJmPSI^IATVLTSIQIiLMSEPNPDD 
PLMAD I SSEFKYN KPAFLKNARQVnTEKHARQKQKADEEBMXJDNL 
PEAGD5RVIINSTQKRKASQLVGI BKKFHPDV 


6387 


1 


662 


PGPTHASAnAWADAWAQPNMAMHNKAAPPQI PDTRRELAELVKR 
KQEliAETLANLKRQr!fAFEGS YI^DTOMYG^ IRGWDRYLTNQK 
NSNSKiroRP^RKFKKAERLPSKSSVTSAAAVSAIJu3VQDQI>IRK 
REPGSGTESDTSPDPHNQENBPSQEDPEDUXSSVQGVKPQKAAS 
STSSGSHHSSHKKPJCNKNRHSPSGMPDYDPEIDI*KLNKKPPJU7Y 


6388 


1 


662 


PGPTHASADAWADAWAQPNMAMHNKAAPPQI PI/rRRELAKLVKR 

K^IiAETLANLBRQIYAFBGSYliBOTO^IYG^ 

NSNSKNDRRNRKFKEAERLFSKSSVTSAAAVSALAGVQDQtIEK 

RBPGSGTESDTSPDFHNQKNEPSQEDPEDI^SVQGVKPQKAAS 

STSSGSIIHSSHKKRKNKNPJISPSGMFDYDpBIDLKIiNKKPRADY 


6389 


1074 


497 


ABPGDRMAGHRLVLVLGDLHI PHRCNSLPAKFKKiLVPGKIOHI 
LCTGNI^nTCESYDYLKTLAGDVH IWGDFX)ENLNY PEQKVVTVG 
QFKIGLIHGIIQVI PWGDMASLAT*I/QRQFDVDI I j I SGHTHKFEAF 
EHENKFY I NPGSATGAYNALETNI IPSFVTaMDIQASTVVTYVYQ 
IiIGDDVKVERIEYKKP 


6390 


158 


535 


GEERKBGRAPGKAFAPERNPAKMEKEETTREIiLIiPNWQGSGSHG 
LTIAQRDDGVFVOjBVTO^SPAARTGVVKEGDQrVGATIYFDNI^ 
SGRVTQLLNTMGmrrVGLKLHRK^ 


6391 


5386 


2697 


WWNSKTECYI^IQTQENFPANLNELVWCIVI^vxIVkiu.Xa 
HSLLGS RNQLARAVLNPNPMDFCTKDIiLTTTSER I IAYLRDFNE 
DQKKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRL 
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Predicted end 
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Amino acid segment containing signal peptide^ 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E*= 
Glutamic Acid, Phenylalanine, G=Glycine, 
H«Hi3tidine, I=Isoleucine, K=Lysine, 
L=I>eucine, M=Methionine, N=Asparagine , 
P=Pxoline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W-Tryptophan, Y^Tyrosine, X ^Unknown, *~Stop 
Codon, /=possible nucleotide deletion, 
\-possible nucleotide insertion) 








LTKNQRKQHSDENSNAKIKQNRVLVCAPSNAAVDELMKKI II.EF 
KEKCKDKKNPLGNCGDINLVRLGPBKSINSEVLKFSLDSQVNHR 
MKKELPSHVQAMHKRKEFLDYQLDELSRQRALCRGGREIQRQEL 
DKNISKVSKKRQKLASKIKEVQGRPQK^SIIILESHIICCTLS 
TSGGIJ#I*BSAFRGO<5GVPrecVIVDBAGQSCBIETLTPLIHRCN 
KJ>I LVGDPKQLPPTVlSMKAQEYGYBQSMMARFCRIiLBBNVBHN 
MISRLPII^LWQYRMHPDICLFPSNYVYNRNLKTNRQTBAIRC 
S S DWP FQ P YLVFDVGDGSERRDNDSY INVQE I KLVME 1 1 KLI KD 
KRKDV3FRNIGIITHYKAQKTMIQKDIiDKBPT>RKGPAEVDTVDA 
FQGRQKDC^IVTCVRANSIQGS IG FLASLQRIjNVTI TRAKYSL F 
I LGHLRTLKENQHWNQIi IQDAQKRGAI I KTCDKNYRHDAVKI I*K 
LKPVIjQRSLTHPPTIAPEGSRPQGGLPSSBCLDSGPAKTSVAASL 
YHTPSDS KE I TLTVTS KDPEIRP PVHDQIiQDPRLIjKRMG I BVKGG 
I FLWD PQ PSS PQHPGATP PTGE PGFPVVHQDLSHVQQPAAVVAA 
I»S SHKPPVRGEPPAAS PEASTCQSKCDDPEBBIiGHRRKARAPSE 
GEQEKCGSETHHTRRNSRWDKRTLEQKDSSSKKRKrj, 


6392 


972 


186 


GRTGVDIASSMAHRIXJIRIiLTVfDVKDTIJaRI^HPLGEAYATKAR 
AHGLEVEPSALEQGFRQAYRAQSRSFPNYGLSHGLTSRQWWIJ3V 
VLQTFH LAGVQDAOAVAPIAEQLYKDFSHPCrWQVLDG^ 
ECRTRGIjaiAVTSNFX>RRLEGII^CSLGIiRBHF^^ 

tmnDDTD/WST DT KHMDmn nt itmrr«iruivT /mv/vmniiTiy^urTPbr 

KFIJFKlifuKALKJLtfUiM^ 

WGPQAi^PVWDSVPKEHILPSLAHLLPALDCLEGSTPGL 


" 6393 


2017 


730 


TGGSKMAAVATCGSVAASTCSAVATASKSNVTS FQRRGPRASVT 
NDSGPRLVS IAGTRPS VRNGQIiTiVS'IGL PALDQM/5GGLAVGTV 
LL I EEDKYN I YS PLL FKYFLAEG I VNGHTLL VASAKEDPANILQ 
ELPAPLLDDKCKKEFDEDVYNHKTPESNIKMKIAWRYQLLPKME 
IGPVSSSRFGHYYDASKRMPQEIjIEASNWHGFFT»PEKISSTLKV 
EPCSLTPGYTKLLQFIQNI IYEEGFDGSNPQKKQRN1LRIGIQN 
LGSPLWGDDICCaVKNGGNSHSLTKFLYVLRGLLRTSLSACI ITM 
PTHLIQNKAI lARVTTI^DVVVGLESFIGSERETNPLYKDYHGL 
IH IRQI PRLNNL I CDES D VKDLAFICLKRKtiFT I BRLHLP PDLSD 
TVSR33KMDLAKSAKRIiGPGGGMMAGGKKHLDF 




1410 


all 


GAAAGGEGARRRPAAMA I VMAATAAERAVLEEE FRWLLHDEVHA 
VLKQLQDILKFJ^LRF^PLPGSGTEGPAX0^FII»GSC35TDQVKG 
VLTIXX5DAI>SQADVNLKMPR^^U,HFAFREDKQ WKI^ I QOPiR 
bfflVSOAIYLLTSRIXJSYQPKTGAE^^ 

TPATLTIiPB IAASGLTRMFAPAIiPSDLIjVNVYIinjNKIjCLTVYQ 
LHALQPNSl^FRPAGGAVmSPGAMFF>^GRIiEVSHWK^EC 
VIPWLNDALVYFTVSLQIiCQQLKDKISVFSSY^SYRPF 


6395 


13 


658 


PSGRPTRPLCCAARRGAARHGGSVSGWPAGRTPTETSNPGSS VM 
F^VTFFJ>VAVRFIQEWAIJJ)SARRSLCKYRMLDQCRTLASRGTP 
PCKPS CVSQLGQRAEPKATERG ILRATGVAWES QIiKPEBLPSMQ 
DLLEEASSRDMOMGPCTLFIiRMGIjVP^ trrrrtpt.tr FDRPAT jCYR 

PPWSLGCTGLKAAMQIQRWI pvptlghrnpwvardsge 


6396 


1 


1221 


anilsspskrgqbdgtligyspegtplynfmgdafqhssqs ipr"f" 

IKESLKQrLEESDSRQIFYFLCimiFTFVELFYGVLTNSIiGLI 

sdgfhmlfdcsalvmglfaalmsrwkatrifsygygrieilsgf 

INGI^LIVrjVFFVFMBSVARiIDPPEU>™^ 

IGICAFSHAHSHAHGASQGSCHSSDHSHSHHMHGHSDHGHGHSH 

GS AGGGMNANMRG VFIjHVLADTLGS I G V I VSTVL I EQ FGW F IAD 

PLCSLFIAXlilFI^VVPLIKnAOQVI^LRLPPEYEKEtJIIAQEK 

IQKIEGLISYRDPHFWRHSAS I VAGTIHIQVTSDVLEQRIVQQV 

TGILFOWmJNLTIQVEK^YFQHMSGI^TGFH^ 

MKYCKDGTYIM 


6397 


391 


122 


GAGGVGRFEMRAPARMIEWCNDRIjGKKVRVX^ 
KI*IAAQTGTRWNKI VLKKWYT I FKDHVSLGD YBIHDGMNIjBI*YY 
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Amino acid segment containing signal peptide 
(A=> Alanine, C=Cysteine, D^Aspartic Acid, E» 
Glutamic Acid, F^Pnenyl alanine, G=Glycine, 
H^Histidine, I=Isoleucine, K^Lysine, 
L ^Leucine, H=Methionine , N=Asparagine, 
P=*Proline, Q=Glut amine, R=Arginine, 
S=Serine, "^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *«=Stop 
Codon, /^possible nucleotide deletion, 
\=*possible nucleotide insertion) 








Q 


6399 


353 


1306 


HKQMaPLimoaaLLPTTVPPATMRIWLLGGLLPFIiLLLSGLQ 

RPTEGSEVAIKIDPDFAPGSPDDQYQGCSKQVMEKLTQGDYFTK 

DIKAQKNYFPJWQKAHIJlWIiNQGKVL^ 

SNVHSDFTBAMASVARTPQQYKRSPHPKYIJnm 

SIMENGTLC!nmiYRTKDVHFNAYTGATIRFGQFI>STS 

QBFGNQTLPTlPTCWAPVQypSLKKEVLIPPYELFKVINMS^ 

PRG£WI^LRSTGl«»STYNC$Ii,K^ 

VIIFSKSRV 


6399 


75 


1245 


PNLETYFGRRCEKDSMNPTPTHTPVCRKRTVVS KRGVAVSGPTK 
RRGMADSLESTPLPSPEDRIiAKLHPSKKLliEyYQKKMAECKAEN 
EDLLKKLELYKEACHBGQHKIjECDLQQR^ 

LFQE R EH VLRL Y S iiN DRIiR I RELiE D KKK I QN JjLAL VGT DAGEVT 
YFCKEPPIIKVTILQKTIQAVGECEQSE33AFKADPKISKRRPSR 
ERKBSS EH YQRD IQTLI LQVEALQAOIiGEQTKLS REQ I EGI» I ED 
RR IHIiBE IQVQHQRNQNKI KELTKNLHHTQKLLYES TKDFLQLR 
SENQNKEKSWMLEKDNLMSKI KQYRVQCKKKEDKIGKVLPVMHK 
SHHAQS EYI KVMS LCRNEWYFSGRVEG I PKNLQFVM 


6400 


2520 


1053 


KUWKCDEVWEVQSAII^IWCGYAMKTGKFPHNLMERKDPETWU 
r>NISVTFLSLTDLQKNETIJ>Tir»I SLSGAVQLKHLSNNLETIiKR 
DPXKLLPLBIjSFYI»IdWLnp^ 

QTACaulI^QIDDSVQDAraWQamJCAILRMKQLEDH 

S LIGHS ARVYALYYKDGLli CTGSDDI^ AKIjWD VSTGQCVYG I QT 

HTCAAVKJT)EQKLVTGS FDNTVACW KWS SGARTQHFRGHTGAVF 

SVDYNDELDILVSGSADFTVTCVWALSAGTCLNTLTOT 

VLQKCKVKSIJLHSPGDYTLLSADKYE I KI WP IGRB I NCKCWCTTj 

SVSEDRSICLQPRLHFDGKYIVCSSAIiGLYQV?DFASYDII,RVXK 

TPBIANIJUjLGFGDI FAIJjFDNRYIjYIMDIiRTBSLISRWPIjPEY 

RKSKRGSSFl^KASWIiNGLDGHNDTGLVFAXSKPDHS IHLVLW 

KBHG 


6401 


109 


766 


PGAAWSRPDLRGCCTGPQPALRMLVLPSPCPQPLAFSSVETMEG 
PPRRTCRSPBPGPSSSrGSPO^SSPPRimYI^liyrOGVPYTVI, 
VDEESQREPGASGAPGQKKCYSCPVCSRVFEYMSYLQRHSITHS 
EVKPFECDIOGKAFKRASHIJ&HHSI^ 

RDMiSIAQHSRVIt^BRPFQCPHCPPJRFMF^NTI^KHTRWKHP 


6402 


1196 


279 


TTSQCXKJIRQSSAIPVASMEFAAJCIiRNALIiIiLPKEQQDPKQEN 
GAKNSNQLGGNTBSSESSBTCSSKSHDGDKFIPAPPSSPLRKQE 
LENLKCS I IJVCSAWALALGimiMALNHADKIJ^PKLSGSL^ 
IX3HLYAAEALISLDRISDAIim^PENVTDVSI^ISSNEQIX2GS 
DKGENBAMESSGKRAPC^OTSSVNSARTVMLFOTX^YCT^EY 
DKARKCI^HQAASMIHPKEVPPEAI LIAVYI^LO^IGNTQIAIjQI I 
KRNOJ^AVKTHSBVRKKPA^PVHPIQPIQMPAFTTVQRK 


6403 


2 


1690 


RG IHTSVLCX^LQNQN YSI INWIMNI^ I TNLQ 
RS VDDTSQAIQR IKNDEX?NLQQVFLQAKKDTDWLKEKVQS LQTL 
AANNS AIiAKANNDTLEDMNSQIiWS FTGGjMFJWITTISOftNEQNIiK 
DLQDLHKDAENRTAIKFNQLEERFQLFETDI VNI ISNISYTAHH 
IJlTLTSNIiNEVRTTCrm^^ 

MQQDL^SRLDTEVAKI^ VIMEEMKLVDSKHGQLIKNFT I LQGP 
PG PRGPRGDRGSQGP PGPTGNKGQKGEKGE PGP PGPAGERGPIG 
PAGPPGBRGGKGSKGSOGPKGSRGSPGKPGPQGPSGDPGPPGPP 
GKBGLPGPQGPPGFQGWTVGEPGVPGPRGLPGLPGVPGMPGP 
KGPPGPPGPSGAWPLALONEPTPAPEDNSCPPHWKNFTDKCYY 
FSVE KK I FKDAXLFCEDKS SHLVFINTREEQQW I KKQMVGRESH 
WIGLTDSERENKWKWLDGTSPDYKNWKAGQPDNWGHGHGPGEDC 
AGLI YAGQWNDFQCEDVNNF I CRKDRETVliS SAL 


6404 


1012 


222 


AAALAMAAPAPGLISVFSSSQEIX3AAIiAQLVAQRAACCIjAGARA 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, 0?Cysteine, D«Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glyc±ne, 
H=rHistidine, I-Isoleucine, K^Lysine, 
I»=Leucine, M=Methionine, N=Asparagine, 
P=«Proline, Q^Glutamine, R=d\rginine, 
S^Serine, TVThreonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








RFALGIiSGGSLVSMIARBLPAAVAPAGPASLARWTIiGFCDERLV 
PFDHAESTYGliYRTHLLSRLPIPBSQVITINPELPVEEAABDYA 
KKLRQAFQGDS I PVPDLLlLGVGPDGHTCSLFPDHPIiLQKRKKI 
VAPISDS PKPPPQRVTLTLFVLNAARTVI PVATGEGKAAVLKRI . 
LEIXJBKNPLPAALVQPHTGKLCWFLDEAAARLIiTVPFBKHSPL 


6405 


1 


1456 


AALPRPTPRAPLGREGTGSDSKMAASMFYGRLVAVATLRNHRPR 

TAQRAAAQVIjGSSGLFNNHGLQVCKMQQRNLSI^EyMSMKLLQE 

AGVSVPKGYVAKSPDBAYAIAKKI^KDVVIKAO^J«^RGKGT 

FESGLKGGVKIVPSPEEAKAVSSO^IGIOCLPTKQTGEKGRICNQ 

VLVCERKYPRREYYPA1TMEP>SFQGPVLIGSSHGGVNIEDVAAE 

TPRAI IKEPIDIEEGIKKEQALQLAQKMGFPPNIVESAAENHVK 

LYSLFLKYDATM IB INPMVEDSDGAVLCMDAKINFDSNS AYRQ K 

KI PDIX3DWTQEDERDKDAAKANI^YIGLIX5NIGCI.VNGAGLAMA 

TMDIIKLHGGTPANFI^VGGGATVHQVTBAFKIilTSDKKVIi^ 

VNI FGGI MRCDVTAQG I VMAVKDLEI KI PVVVRLQGTRVDDAKA 

LIADSGLKIIiACDDLDEAARW/VW^ 

I 


6406 


1036 


167 


H PRQMRGEDTPEAP P YS SGR YDS I KTEVSGCPBDLTVGRAPTAD 
DDDDDHDDHBDNDKMNDSEGMDPERIjKAFNMPVRIiFVDENIjDRM 
VPISKQPKEK10AIIRSCSRQFPEPQEPJVRKRIRTYLKSCRRMK 
KMGMEMTRPTPPHIiTSAMAENlIiAAACBSETRKAAKRMRliEIYQ 
SSQDEPIALDKQHSRDSAAITHSTYSLPASSYSQDPVYANGGLN 
YSYRGYGALSSNI^PPASLQTGNHSNGBSGEARAIiASRPAPSWV 
CRAALGSGMGRGKQRPVMBRGCLTA 


6407 


492 


150 


VGLCIAVSO^CVIJVQLDAIivVPGQVAQLSCr]^ 
SVryO^RAGSAPRYLLYYRBEEDHHRPADIPDRFSAAKDEAHNAC 
VLTI SPVQPEDDADYYCSVGYGFS P 


6408 


1458 


903 


RGC ITSSQAWRIiFGGVTRGFNMRI B KCYFCSGPI YPGHGP4MFVR 

NDCKVPTlFCKSKaiKNFKKKRNPRKVRWTKAFRKAAGKELTVDN 

S FB PEKRRNE P I K YQR EJb WNKT I DAMKRVEE I KQKRQAKF I MNR 

IrfOOIKEIiQKVQDIKEVKGNIHL:^^ 

DVDMEDAP 


tl409 


1*0 


446 


NTAIANIJLRCFTCDRLCGG^rrAPAP 

GPACLPTKTFR3 YlaPRCffi^TYS CVHORAHLAKHDEL I SKS FOGS 
HGRAYLFNSV 


6410 


85 


607 


RGGTAGCVACLGCWGQSSSPKAAFPAGSACLPADSCPCLLFQAC 
AISGLFNCITIHPI^IAAGVWM1MNAFTIJJ.CEAPFCCQFIBFA 
NTVAEKVDRLRSWQKAVFYCGMAVVPIVXSLTLTTIJjGNAIAP 
TGVLYGT^AI^KKGDAISYARIC^QRQQADERKLAETLBGEL 


6411 


302 


772 


RLS IMASSLNEDPEGSRITYVKGDLFACPKTDSLAHCISEDCRM 
GAG IAVLFKKKFGGVQBI^QQKKSGF^AVDKRIXJRYI YYTilTK 
KRASHKPTYENLOKSTiEAMKBHCLKNGVTDIjSMP 
WENVS AM I EEVFE ATD 1 K I TVYTL 


6412 


61 


1709 


RPVTS FS PLPGS CGGRLGTRTMLGR S LRE VSAALKQGQ I T PTEL 
CQKCLSLIKKTKFIiNAYITVSEEVALKQAEESEKRYKKGG^IjGD 
I^IPIAVKDNFSTSGIETTCASNMLKGYIPPYNATVVQKIjLDQ 

gaijj^gkinldbfamgsgstdgvfgpvk^ 
phsf^bdsdwlitggssggsaaavsaftcyaaujsdtggstrnp 

AAHCGIiVGFKPS YGIjVSRHGIjIPLVNS WDVPGI I/TRCVDDAAIV 
liGAIJVGPDPl^STTVHBPINKPFMLPS LADVSKLCIGIPKEYLV 
PELSSEVQSLWS KAADLFES EGAKVIEVSLPHTS YS IVC YHVU: 
TSEVASNMARFDGLQ YGHRCD I DVSTEAMYAATRREG FNDWRG 
RILSGNFFIiLKBNYElWFVKAQKVRRLIANDFV^ 
TPTTLSEAVPYLEFIKEDNRTRSAQDDIFTQAVNMAGIaPAVS IP 
VALSNO^LPIGI^FIGPJVFCTOOJ^TVAKWFEKQVOFPVI QL0E 
LMDDC5AVLENEKLASVSLKQ 
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to first 
amino acid 
residue of 
amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AwAlanine, C=Cysteine, D=»Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
Ifc=Hi3tidine, I-Isoleucine, K=*Lysine, 
^Leucine, M=Methionine, N=Asparagine, 
?= Pro line, Q=: Glut amine, R^Arginine , 
S=Serine, T=Threonine, V*-Valine, 
W=Tryptophan, Y-Tyroaine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\ ^possible nucleotide insertion) 


6413 


2 


685 


HBPRCAGMAASLWMGDLEPYMDENFISRAFATMGETVMSVKIIR 
NRLTG I PAGYCFVEFADLATABKCLHKIWGKPLPGATPAKRFKL 
IT^ATYGKQPD^S PEYSLFVGDLTPDVDIX3MLYE FFVKVYPS CRG 
GKWLDQTGVS KGYGFVKPTDKLEQKRALTECWAVGLGSKPVR 
LSVAI PKASRVKPVB YSCWS YS YNQYYQQYQNYYAQWGYDQNT 
GSYSYSYPQYGYTQSTMQTYEKVGDDALEDPMPQbDVTKANKEF 
MBQSBBI»YEtAIjMDCHWQPLDTVSSEXPA^ 


6414 


1 


538 


RGGRAALLPW RRFPCCR PRPQPARPSSRATPGPRS PGMATS IGV 
SFSVGIKiVPFJ^KNAGBPKNTYILRPVFQQRFRPSVVKDCIHAV 
LKBBLAHAKYSPEBMPQLTKHLSBN1KDKIJC^!GPDRYKMVVQV 
VIGEQRGEGVFimSRCFWmiyrD>rn:HDVFMNDSLFCVVAAFGC 
FYY 


6415 


2 


1168 


FTOQWQSSHRRACGLGCEARAGGGEEPRGRASSVAGWVGAFRAP 
FIKAAVAGIXSAGSGKRRRGWKMPVHSRGDKJCBI'NHHDEMEVDYA 
ENEGSSSEDFJ>TKSSSV^BI)GDS5?3>roDBDCERP^MECLDBMSN 
I^KQPrDLKIX3LYKERI>SaVDAKLQEVIAGKAPEYLEPLATLQE 
NMQ I RTKVAG I YRRLCIiESVKNKYBCE I QAS RQHCES EKLIXYD 
TVQSELEEKIRRLBEDRHSIDITSEIjWNDBLQSRKKRKDPFWPD 
KiCKPCWSGPYIVYML^LDIIiEDW'rriRKAMATLGPHRVKTKP 
PVKLEKHI»HSARSEEGR1*YYDGBHYIRGQTICIDKKDBCPTSAV 
ITTINHDB VWFKRPDGSKSKLYTSQLQKGKYS IKHS 


6416 


410 


1519 


EIAP ADLE I PAQ\PVLI»SRATSSTMS VTGGKMAPSLTQE t LSHI> 
GLAS KTAAWCrriOTLRTFLNFS VDKDAQRI»LRAITGQGVDRSAI 
VDVLTt^REQRQLISRNFQERTQQDLMKSLQAALSGNLRRIVM 
ALLQPTAQ FBAQBLRTALKASDSAVDVA IB I LATRTPPOLQECL 
AVY KHNFQ VEAYDG ITS ETSGI LQDIiLLiALAKGGRDS YS GI I D Y 
NliAEQDVQALQRAEGPSREETWVPVFTQRNPEHLIRVFDQYQRS 
TGQEIjEEAVQNRFHGDAO VALLGLASVIKNTPItY FADJCLHQALQ 
ETEPN YQVL IR I LISRCETDLLS IRAEFRKKFGKS LYS S LQDAV 
KGDCQSALLALCRAEDW 


6417 


1 


845 


RGESRVLWS ELKGEAGGAGGWAS S LWARMDNRFATAPVIACVLS 

LISTIYMAASIGTDFWYEYRSPVQENSSDIJ^KSIWDEFISDBAD 

EKTYNDALFllYNGTVGLWRR,CITI PKNM^IVi YS P PERTBS FDWT 

KCVSFTt/rBQPMBKFVDPGNHNSGIDIiLRra 

GLMCFGALIGLCACICRSLYPTIATGlI^^ 

AG I EliLHQKLEIiPDNVSGEFGWS FCLACVSAP W)FMASAI*PTWA 

AHTNRKEYTTiMKAYRVA 


6418 


2 


662 


TRTRPRRPPGI^AAVGKAGARSTSTPAGASPAAAYQADPPPPAH 
TPAPPPPPPCGGIACHGEPAKFYGYDNLQRQP I PTTQQBABLVQ 
YPDCKSSSGNIGEDPDHLNQSSSPSQHFPWMRPQAAPGRRRGRQ 
XYSRF0TIiB1jEKEET^FNPYI*TRKRRIBVSHAI^ 
NRRMKWKKENNKDKPPVSRQEVKIKjETKXEAjOELEET)RA 


6419 


1 


973 


PGRPRVRNFDIiNSKS ILQEFFCTRS IQ I PANRSKTAMSKCP I FP 
MARS ISTSGPI^KBDTGRQKLISTGSI^ATLQCATDSLGLEWHL 
PSPDPVTVPYI*S PLVVWXEI*E5IjIiENIsGDHM 
FWNLVWYFRRLDLPSNLPGL I LSSBHCNKYSK I PRHCMSBDS KY 
VLIOMLWDNMKI^QDPGQPLYILWNAHTQKYPMVHIXQKSIMSF 
NOJBIJ»KSMVKBIKMNDVYGPHSQII»ETLNKCPirFKR 
LFLSIiYALGI^IDrnAFDKBYKMAYDRLTPSQVKSTHNCDRPP 
STGVMECRKTFGBPYL 


6420 


207 I 


1187 


RKM I DKNQTCGVGQDS VP YMI CLIH I LEEWFGVEQLEDYLNFAN 
YLLWVFTPLILLILPYFTIFLLYLTI IFLHIYKRKNVLKEAYSH 
l^WIX^KlVATLWIXSnAAVWHGYEVHGMEiaPEDGPALI IFYH 
GAI P I DFYYEMAKI FIHKGRTCRVVADHFVFKI PGFSLLLDVFC 
ALHGPREKCVEILRSGHLLAI S PGGVRKALI SDBTYNI VKGHRR 
GFAQVAIDAKVPI I PMFTQN I REGFRSLGGTRLFRWLYEKFR YP 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AsAlanine, C=Cysteine, D=Aspartic Acid, B=» 
Glutamic Acid, ^Phenylalanine , G=Glycine, 
HtsHistidine, I-Isoleucine, 10=I.ysine, 
Ii=Ijeucine, M=Methionine, N=Asparagine, 
P=Proline, Q-Glut amine, R=Arginine, 

Serine, T=Threonine, Valine, 
W=Tryptophan, Y=»Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








FAPMYGGFPVKLRTYLGDPIPYDPQITAEfiliAEKTKNAVQALID 
KHQRIPGNIMSALIiERFH 


6421 


1044 


362 


WALSLRRQPEHMSNKIiLSPIIPHSVVIJISEFKMASSPAVIiRASRL 
YQWSLKSSAQFLGS PQLRQVGQI IRVPARMAATIiILEPAGRCCW 
DEPVRIAVRGIAPEOPVT^RASIiRDEKGALFQAHARYRAimiGE 
LDLERAPAJUGGS FAGLEPMGLLWALEPEKPLVRLVKRDVRTPLA 
VELBVU)GHDPDPGRI»LCXmHBRYF^ 

LFIiPPEPGPFPGIVI^FGTGGGIiIjEYRASLIjAGKGFAVMALAYY 
NYKDLPKTMETI^KYFKEAMNYIXSHPEVKGPGVGLLGISKGG 
FXCLSMAS FLKG I TAAVVXNGS VANVGGTLRYKGETIjP P VGVNR 
NRI KVTKDG YADIVDVLNS PLEGPDQKS FI PVBRAESTFIjFLVG 
QDDHNWKSEPTiIANKACKRIjQAHGRRKPQI ICYPBTGHYIEPPYF 
PLCRAS LHALVG9 P I IWGGBPRAHAMAQVDAWKQIiQTFFHKHLG 
GREGTIPSKV 


6422 


1B1 


2133 


EGENI^SWFQEFWGDIAKEFYWKTPCPGPFLRYNFDVTKGKIFIE 

WMKGATTNI CYimJ^POmiEKia^DKVAFYWPa^EPGETTQITY 

HQLLVQVCQPSNVIjRKQGI HKGDRVAIYMPMI PBLWAMLACAR 

1GALHS I WAGFSSESLCERILDSSCSl»LITTDAFYRGEKIiVKI* 

KEIJ^BALQKC^BKGFPTOCCIVVKHDGRA^ 

KR3CPDVQI SWNQGIDLWWHELMQEAGDECE PEWCD AEDPLFI L 

YTSGSTGXPKGVVHTVGGYMLYVATTFKYVFTJFHABDVFWCTAD 

IGWITCHSYVTYGPLANGATSVIiFEGIPTYPDVNRLWS i vdkyk 

VTKFYTAPTAIRLLJMKFGDE P VTKHSRASLQ VLGTVGEP INPEA 

WLWYHRWGAQRCP IVDTFWQTETGGHMLTPLPGATPMKPGSAT 

FPFFWAPAIIJIESGEELBGEAEGyiiVPKQPWPGIMRTVYGNHE 

RFETTY PKKPPG YYV1GIXJCQRDQDGYYW I TGRI DDMLNVSGHL 

LSTAEVESALVEHEAVAEAAVVGHPHPVKGKCLYCFVTLCDGHT 

FSPKIiTEELKKQIREKIGPIATPDYIQNAPGLPKTRSGKIMRRV 

liRKIAQNDHDLGDMSTVADPSVISHIiFSHRCLTIQ 


6423 


614 


1237 


ANI/KE IPRDLPPKTVIiL YLDSNQITS XPMEI FKDLHQLRVIiNIjS 
KNGI EFIDFJiAFTCGVAETLQTLDLSDNRIQSVHKNAFW 
RlANNPWHOXrrUXJVLRSMASN^TAHNVI CKTSVLDEHAGRP 
FLNAANDAI)LCNLPKK7ITO YAMLVTM FGWFTMVI S YWYYVRQN 
QEDARRHLEY kKSLPSRQKKADEPDD I STW 


6424 


1 


1188 


KKV3WPVAA^fVHCSCVLKRKYGNFIDKLRLFTRGGSGGMGYPRL 
GGEGGKGGPVWVAHNRMTIiKQLKD^^ 

LKGSKGKDWBIPVPVGISVTDENGKI IGELNKENDRI LVAQGGL 
GGKX.1.TNI FI.PLKGQKR I IHLDLKLIADVGLVGFPNAGKSSLLSC 
VSHAXPAXADYAFTTLKPELGKIMYS DFKQIS VADLPG L I EGAH 
MNKGMGHKFLKHIERTRQlJjFVVDISGFQLSSHTQYRTAFEri I 
LLTKBxjELYKEELG^TKPAIilA^ 

KDFliBliFElCNMIPERTVBFQHIIPISAVlt^IEEtjKNCIRKSIi 

htmAMniTMn&T tJW/YT.T.WT tJTCrYPMC Q'l'l»'PPCirTT&^l T T w r'g VMTMTT 
U{SUAv<UJ&PI UfUjIilvIvUUliri liW io LTITHOO 1 a JrlrLJ JUiAV 1 X OWIU X 1 


6425 


1850 


1144 


LAMKGGGGIPLETLKJ3ESQSRHVLPASFEVNSLQKSNWGFLLTG 
LVGGTLVAVYAVATPFVTPALRKVCLP 

CRl?OS I.VDIGSGDGRI V I AAAKKGFTAVGYELNPW LVWYS RYRA 
WRBGVHGSAKFY I SDLWKVTFSQYSNWI FGVPQMMLQLEKiCLE 
RELEDDARVlACRFPPPHWTPDHVTGEGrDTVWAYDASTFRGRE 
KRPCTSMHFQLPIQA 


6426 


30 


565 


SRGAAVGGMSVAGGEIRGDTGGEDTAAPGRFS FS PEPTLED IRR 
LHAE FAAERDWEQFHQPRNLLIJUiVQE VGELAELFQWKTDG E PG 
P^WSPRERAAI^EEI^DVLIYLVALAARC^VTO^ 
NRRRYPAHLARS S S RKYTELPHGAISEBQAVGPAD I PCDSTGQT 
ST 


6427 


145 


959 


AASWGPPHVPKAGKMVSWMICRLVVIiWGMLCPAYASYKAVKTK 
N1REYWHMMYWIWALFMAAJEIVTDIFISWFPFYYEI KMAFYL 



502 



WO 01/53312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


Predicted 
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corresponding 
to first 
amino acid 
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amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A»Alanine, (^Cysteine , D=Aspartic Acid, E= 
Glutamic Acid, F«Phenylalanine, G=Glycine, 
H=>Hietidine, I=Isoleucine, K-Lysine # 
L=Leucine, M==Methionine, N=Asparagine , 
P=Proline, Q=Glut amine, R-Arginine, 
S=Serine, T«=Threonine, V=*Valine, 
W=Tryptophan, Y ^Tyrosine, X-Unknown, *=Stop 
Codon, /=»possible nucleotide deletion, 
Vpossible nucleotide insertion) 








WLLS PYTKGASLLYRKFVHPSLSRHEKBII3A.YI VQAKERS YETV 
LSFGKRGLNIAAS AAVQAATKSQGALAGRLRS FSMQDLRS I SDA 
PAPAYHDPLYLKDQVSHRRPPIGYRAGGLQDSDTEDKCWSDTEA 
VPRAPARPRBKPLIRSQSIiRVVKRKPPVRBGTSRSLKVRTRKKT 
VPSDVDS 


6428 


1982 


444 


SGSGGKMBDKQHVPIDIQTSKLI.DWLVDRRHCSIjKWQSLVLTIR 

EKINAAIQDMPESKEIIAQLLSGSYIK^FHC^^ 

KNI FGR YSSQRMKI5WQE I IALYKKDNTYIiVELSS I^VRNVNYEI 

PSLJQCQIAKOQQLQQBYSRKBSBCQAGAAJBMREQFYHSCKX2Y6I 

TGKNWGELIJUjVKnLPSQIAEIGAAAQQSLGEAIDVYQASVGF 

VCES PTEQVLPMLRFVQKRGNSTVYEWRTGTE PS VVERPHIiBBL 

PEQVAEDA1DWGDFGVEAVSEGTDSGISAEAAGIDWGIFPESDS 

KDPGGIXSIDWGDDAVALQITVLKAGTQAPBGVARGPDALTIiLEY 

TBTRNQ PTjDELMELKI FLAQRAVBLSEEADVLSVSQFQIiAPAI L 

QGQTKEKMVTTWSVIiEDLIGKIiTSLQLQHLFMlLASPRYVDRV^ 

EFLQQKfcKQSQLIjALKKBWmjKQ^ 

TKELQKLIEADISKRYSGRPVNLMGTSL 


6423 


3413 


3442 


EPSS WTAAPRGPLAAHPLEAAVQEDDRRALS FDSRI KVFANGTL 
VVKS VTDKIJAGD YIXVARNKVGDDYVVLKVD EHKBE 
NDHKVFYGGDLKVDCVATGLPNPEI S WS LPDGS LVNS FMQSDDS 

UunlAJlI V v CKix%\J 1 J_>i C [%Ct V LM T lt\JEi.nAjl'I X^TjHJVWy VVjAJj'JjPIK.VK 

VKVVTAPATIRNKTCIAVQVPYGDV^^ 

PTNKVI PTSSEKYOI YQIX5TLbIQKAQRSDSGNYTCIiVRNSAGE ' 
DRKTVWIHVNVQPPKINGNPWPITTVREIAAGGSRKLIDCKAEG 
I PT PR VLWAFP EGWLPAP YYGNR I TVHGNGSLDI RSLRXSDS V 
QLVCMARNEGGRARLl VQLTVI^PMEKPIFHDPI tJBKITAMAGH 
TISLNCSAAGTPTPSLVWVLPNGTDLQSGQQLQR FYHKADGMLH 
I SGLS S VD AGA YR CVARNAAGHTERIiVSLKVGLKPEANKQYHNIi 
VS 1 1 NGETLKLP CTPPGAGOGRFSWTIjPNGMKXEG P QTLGR VSL 
LDNGTLTVREASVFXJRGTYVCRMETEYGPSVTS I PVXVIAYPPR 
rTSEPTPVIYTRPGNTWLNCMAMGIPKADITWEI^DKSHI.F^ 

VQARLYGNRFIjHPQGS lti qhatqrdagfykcmakni lgsds kt 

TYIHVF 


6430 


1946 


602 


RTRVSTGIiRRTLIjWSEAVGASSTRGDTGI pgsgbggagpgggeg 
AMLEAMAEPSPBDPPPTLKPETQPPEKRRRriEI)FNKFCSFVLA 
YAGYI PPSKEESDWPASGSSSPLRGESAADSDGWDSAPSDLRTI 
QTFVKKAXSSKRRAAC^PTOPGPPR^TFSRI^PDSATLLEKM 
KLKDSI^IJX3PKVASPMP^SLTHTSRPPAALTPVPLSQGDI»S 








KRKLKKAFJiGrffiLPPPGPPOAPPSDTPSEEEREEEEEEREKFJ^ 
IWGGEAP VP VLPTP PEAPRPPATVHPEGVP PADSES KEVGS TE 
TSQDGDAS SSEGEMRVMDED I MVESGDDS WDI>ITCY CRKPFAGR 
PMIBCSL<XHWIHI^CAKIKKTNVPDFF^ 
GPPKSGEP 


6431 


3 


605 


WWNSSYNLPAYAPYliPCBACAMQDGRKGGAYAGKMEATTAGVGR 
LKEFJUjRRORLKAbREK'IGRKPKEDGEPKTKHLREBEEEGEKH 
EtELRLRNYVPEDEDLKKRRVpQAKPVAVEEKVKEQLEAAKPBPV 
I F^VDLANIAPRKPDHDUOIDVAKKLEKLKKRTQPJ^ 
LKGQEDSLASAVDAATEQKTCDSD 


6432 


56 


1692 


GGLGTMGSRIKQNPETTFEVYVEVAYPRTGGT1>SDPEVQRQFPE 
DYSWEVWTLTKFCFPFYVDSLTVSQVGQHFTFVLTDIVSKQR 
FGFCRLSSGAKSCFCILSYLPWFEVFYKLLNIIJU3YTTKRQENQ 
WNELI^TLHKLPIPPPGVSVHLSVIISYFTVPDTRELPSI PEKRN 
LTEYFVAVDVNNMLHLYASMLYERRI 1,1 1 CS KI»STLTAC IHGSA 
AMLYPMYWQHV¥IPVI,PPHIJ*DYCCAPMPYT.IGIH^ 
MALDDWI LNVDTNTLETPFDDLQS LPND V I S S LKNRLK KV3TT 
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location 
corresponding 
to first 
amino acid 
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amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, O Cysteine, D=Aspartic Acid, B« 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L^beucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, "^Threonine, V«Valine, 
W-Tryptophan, Y=Tyrosine, X«Unknown , *=Stop 
Codon, /=possible nucleotide deletion, 
\=pos6ible nucleotide insertion) 








TGDGVARAFUCAQAAFFGSiimAI»KIEFKBPITrXSBAPV3HYR 
SGAMRQFIKJNATQLQLFKQFIDGRLDIiLNSGEGFSDVFEEEINM 
GBYAGSDKLYHOm^TVmSSGAIIiNTVKTKANPAMK'i'V YKFDI 
ABNGCAPTPEEQLPKTAPS PLVEAKDPKLRBDRRPITVHFGQVR 
PPRPHWKRPKSMIAVEGRRTSVPSPEQNTIATPATLHILQKSI 
TKFAAKFPTRGWTSSSH 


6433 


1524 


484 


APVTICRKEVFAKDSKGSALDAGRDPKllPALPErLCESGWASNTA 
P/ITPPQPGWCLOC5KDFKSSCQTPGRBKBIuu^ATMHC^CS FLMLL 
IjPLLLIjLVATIGPVtSM^TDBEK^ 

HKRVTOEELAAFAKAYARO^^GiraKKRGRI^KNI^AlTDEGMDV 

PIAMEEWHHERBHYttl^AATC^PGQMOOT^ 

SHFCEKLQGVEETNIELLVCNYEPPGKVKaKRPYQEGTPCSQCP 

SGYHCKNSLCEPIGSPBDAQDIiPYLVTEAPSFRATEASDSRKMG 

AEGPDKPSWSGLNSGPGHVWGPLLGIiliLLpPLVIiAGIF 


6434 


40 


2002 


MPQLNFtJMADPTQMGGLSMl.I.IAGEHALGTPBVFSGTCRPDVSE 

SPKI^QKSPLFQFAEISSSTSHSDASTKQCXJTSAIiFQFAEISSN 

TSQLGGAEPVKRCGKSALKQLiABMClJVSBGMKM^ 

DGGRIKELEKGKEBKEIK^KTDETRIXiKEA^ 

KELRNFEALQ IDD IMAI KMEDPKE IRXEEI£EDHKCSHFPDFS Y 

SASSKI I ISDVPSRKDHMCHPHGIMI IEDPAALNKPEKT>KKKKK 

K3KMDRHGNDKSTPKBCTCKBCRQSSESD I BSVT YTIBAVAKGDWG 

IEKLGDTPRKKVRTS3SGKGS I LDAKPPECKKVKSREKKMSKEKS 

SDTTKESRPPDFI S I SAS KN I SGETPEGI KAEPLTPMEDALPPS 

LSGQAKPEDSDCHRKtETCGSRKSERSCKGALYKTLVSEGMLTS 

LRANVDRGKRSSGKGNSSDHEGCWNBESWTFSQSGTSGSKKFKK 

TKPKRDCIiLGSAKL»DEEFEKKFNSLPQYS PVTFDRKCVPVPRKK 

KKTGNVSSEPTKTSKGSGDKWSNWQLFLDAIHPTKAIFSEDRNT 

MEPVHK\nWIPSIFNT^EPTTTARTFGGQPKEKSKENPDYSPCQ 

DTQRAGYH1 I BEVLW MTNr^MNNCGGVYLKQ LR HTAMTNA 


6435 


2227 


657 


ALQRIlAAAAYAHPEYEERFIiQEKWSQQINSIKI^^ 

VVKSQRPLQRQVHI»RGRPASQPTVIRGITYYKAKVSBEENDIEE 

QQDEFFSGDNGVDI»IjIEDQLLIOfNGLMTSVTRRPA^ 

VTSDmARTAPWSSAl,PQPSTSDPSIANHASVGPTLQTTSVSPD 

PTRKSVJ^PSPQVPATTVAHTATQQPi^APPAVSPRBAtMEAM 

HTVP VPPTTVRTDSLGKDAPAGRGTTPAS PTLSFBEBDDXRNVX 

GRCKDTLST I TGPTTQKnTYGRNF^AWMKDPIAKDERIYVTNYYY 

GNTLVBFRNLENFKG^RWSNSYKLPYSWIGTGHVV^ 

AFTRNI 1 KYDIiKQRYVAAWAMUIDVAYEEATPWRNQGHSDVDFA 

VDKlWLWLIYPALDDEGFSQEVTVIiSKIJ^ 

LRRNFYGMCFVICGVLYAVDS YNQRNANI SYAFDTHTNTQIVPR 

LLFENB YFYTTQI DYNPKDRLL. YAWDNGHQVTYHVI FAY 


6436 


1295 


341 


GACRPPVRQD PDSG PDYEAJ^AGATVTTHMVAG AVAG I LEHC VM 
YPIDCVKTRMQSLQPDPAARYP^m>EALWRI IRTEGLWRPMRGL 
NVTATGAGPAHALYFACYEKLitKTLS DVI K PGGNSH I ANGAAGC 
VATLLHDAAMNPAE WKQRMQMYNS P YIIRVTDCVRAVWQlfEGAG 
AFYRS YTTQIiTMNVPFOAIHFMTYE FKJBHFNPQRRYNPSS HVL 
SGACAGAVAAAATTPI^VCKTI^TQESIJ^3HITGHITG>^ 
AFRTVYQVGGVTAY FRGVQARVI YQI PSTAIAWS VYBFFKYLIT 
KRQEEWRAGK 


6437 


182B 


360 . 


PPAPAPPASPARHVTRTARGHIiBGGSRAPPLLQAVFLQIKNMVK 

LIHTLADHGDDVNC<!AFSFSrJATC3LDKTIFdiYSLRDFTELPH 

S P LKFHTYAVHCCCFS PSGHILAS CS TDGTTVLWNTENGQMZAV 

ME0PSGSI^VO}FSPDSTCJASGAAlX5TVVI>^QSYKLYRCG 

SVKDGSrAAC^SPNGSFF\TTGSSCGDLTVVroDKMRC3aE 

DIX^ITCCDFSSQPVrSIKJEO^L^FFRLASCGQDCQVKIWIVSFTH 

I LGFT^KYKSTLSGHCAFVIiACAFSHD 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A*= Alanine, C=Cysteine, D^Aspartic Acid, B- 
Glutamic Acid, F^Phenyl alanine, 6=Glycine, 
H-Histidine, I=Isoleucine, K=Lysine, 
L=Leucine , M=Methionine, tt=Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S^Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyxosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








TOTKNIIiHTLTQHTRYVTTCAFAPNTLLLATGSMDKTVNIWQFD 
LBTLCQARSTEHQLKQFTEDWSEBDVSTWLCAQDLKDLVGI FKM 
NNIDGKELIJ^TK^LADDIJaESI^LI^KVI^KIEELRTK^ 
LSSGI PDE PI CPlTREIJWDPVIASDGYSy^KEAMBNWDPAKRN 
RTSPP 


6438 


109 


901 


EVQILRAKMPQTGGI,IVFyGLIAQTMAQFGGI^PLDQT^ 
NPALPLSPTGIiAGSLTNAI^NGIiLSGGIiIX5ILENIiPLLO 1 LKPG 
GGTSGGIiLGGLLGKVTS VI PGLNNI IDIKVTDPQIiLELGLVQSP 
DGHRI»YVTI PLGI KLQVNTPLVGAS LLRLAVKLD ITAE I LAVRD 
KQEIiIHliVI/3DCTHSPGSLQISLLIX?I/?PLPIO^IJJ>SLTGILN 
KVLPELVQGNVCPLVNEVI^GI^ITLVHDIV^ 


6439 


23 


412 


SIQTASAITTEMASQSQGIQQ1JXJAEKRAAEKVADARKRKARRL 
KQAKEKAQMKVEQYRREREHEFQS KQOAAMGSQGNIjSAEVEqAT 
RRQVQG>K?SSQQRNRERVIiAQUiGMVCDVRPaV^^ 


6440 


3 


517 


RARVfNSDMGDLPGLVRJ>S IALRIQPNDGPVPVKVDGQRFGQNRT 
IKLLTGSSYKVEVKIKPSTI^VENJSIGGVLVPLELKSKEPD^ 
RVVYTGTYDTEGV , rPTKSGKRQPIQITMPFTDIGTFETVWQVKP 
YNYHKRDHCQWGSPPSVIEYECKPNBTRSLMWVNKBSFIi 


6441 


234 


1373 


KSGGLRRRQRPGRSAAVGE E ELP PGM KKFKAAMLJjGSVGDAIjGY 
PJfVCKBNSTVGMKIQEELQRSGGLDHLVIiSPGEWPVSDNT I MHI 
ATABAIiTTDYWCIJJDLYREMVRCr VE I VEKLPKRRPDPATI EGC 
AQLKPNKYI.I.AWH'rPFNEKGSGFGAATKAMCIGLRYWKPERLET 
LIEVSVEajRMTHNHPTGPI^SI^AI^VSFAAQGKPLVQWGRD 
MLRAVPIJu^EYCTKTII^AEYQEBMFYPEAKWQFYLEERKISK 
DSEN KAI FPDN YDAEBREKT YRKWSSEGRGGRRGHDAPM I AYDA 
IZiAAGNSWTBIiCHRAMFHGGESAATGTIAGCIjPGIiLYGLDLVPK 
GliYODLEDKEKLEDLGAALYRIiSTEEK 


6442 


34 


796 


AEDPAGGLAGQDTMFARGLKRKJWGHREDVEGALAGIjKTVSSYS 

LQRQS LLDMS LVKLQLCKM L V E PNLCRS VLI ANTVRQ I QEEMTQ 

DGTWRTVAPCAAFJiAPIiDRLVSTEILaiAAWGOEGAHPASGlXJD 

GHTC^PVSDU!PVTSAQAPP^a^SAWEMDGPRENRGSFHKSLD 

QIFETLETKlfPSCMEJSLFSDVDSPYYDIiD^ 

EGLEGLAPATPGPSSSCKSDLGELDIIVVEILVET 


6443 


2 


555 


MASPAAS SVRPPRPKKEPQTLVI PKNAAEBQKIiKIiERLMKNPDK 
AVPIPEKPISEWAPRPPPEFVRDVMGSSAGAGSGEFHVYRHIiRRR 
KYQRQDYMDAMAKKQKIillAEFQKRriEKtJKIAAHKG/TAKRRXKRQ 
KLKBKKIiLAICKMKLEQKKQEGPGQPKEGGSSSSAEASGTEBEEB 
VPSFTMGR 


6444 


390 


899 


GSTPRGKMRAP I PEP KPGDL I E I FRPFYRHWAI YVGDGYWHLA 

PPSEVAGAGAASVMSALTDKAIVTCICBIX^ 

DKYSPLPCSKI IQRAEELVGQBVL YKLTS EHCEHFVNE LR YGVA 

RSDQVRDVI IAAS VAGMGLAAMSLIGVMFSRNKRQKQ 


6445 


2 


753 


AGAAGAAGAARS PRP QAHTKGVRGLP S RRRS PDCGRMELAAGS F 
SEHQI'T4FJVCAEL(X}PAI>AGADWQriI»VETSGISrYRLLDKKTGLY 
EYKVFX5VLEDCSPTIiLAI>;rYMD^ 

tvvywbvkyt fpmsnrd yvyiirqrpjjldmegpjcihv^ larstsm 
pqlgersgvirvkqykc^i^iesdgkkgskvfmyyfdnpggqi p 

SWLINWAAiTOVPNFLKDMARACXJNYLKKT 


6446 


1 


1651 


RCPTRS P P PDT PGSRGTTAMCSIiASGATGGRGAVENEED LP ELS 
DSGDEAAWEDEDDADLPHGXQQTPCLFa^RLFTSAEBTFSHCKS 
BI£QF^aDSMVHKHGLEFTGYIKLINFIRLKNPTVEYMNSIYNPV 
PWEKEEYLKPVLEDDLLLQFDVEDLYEPVSVPFSYPNGLSENTS 
WEKLKHMEARAIiSftEAALARARKD^ 

cssstsviadlqededgvyfssygktoie^e^kdkjrtesyrd 
fi yqnph i fkdkwldvgcgtg i i^mfaakagaxkvlgvdqse i 
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ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to fixrst 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
se<juence 


Amino acid segment containing signal peptide 
(A- Alanine, C=Cy6teine, D=Aspartic Acid, E=» 
Glutamic Acid, F=Phenylalanine, G=Glycine , 
H=Histidine, I=Isoleucine, K=*Lysine, 
h -Leu c i ne , M= Methionine , N^Asparagine , 
P-Proline r Q^Glut amine, lUArginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, YeTyrosine, X=u*nknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








bYQAMDI IRLNKLKDTITIiI KGKIEBVHLPVEKVDVI ISEWMGY 

FLLFEShu^SVLYAJKNKYIiAKGGSVYPDICTISLVAVSDVNKHA 

DR I AFWDDVYGPKMS CMKKAVI PEAWEVLD PKTL 1 5EPCG I KH 

IDCHTTSISDI^FSSDPTLKITRTSMCTAIAGYFDIYFEKNCHN 

RVVFSTGPQSTKTHWKOrVFTiLB^ 

KDPRS LTVTLTLNNS TQTYGLQ 


6447 


1554 


1068 


RI/5PAEflHLSGPCHATIJ3&ANRGRAI£;VRAAWRG 
SRTNIATGIPSSKVKYSRI^STDDGYIDI^PKKTPPKIPYKAIA 
LATVLFLIGAPLI IIGSIil^GYISKGGADRAVPVLIIGILVFL 
PGFYHIiRIAYYASKGYRGYSYDDI PDFDD | 


6448 


74 


559 


GQVLSHC YH YRSSRWRRGGLSRGRGAGVMALVP YE ETTE FGLQK 
FHKPLATFS FANHT I Q I RQDWRHLG VAAWWDAA I VLS TYLEMG 
AVELRGRSAVELGAGTGL VG IVAALLACR I R YERDNN FLAMLER 
QPIVRKVHYDPBKDVHIYEAQKRNQKEDL 


6449 


597 


1876 


e ygvcknlrkx»k itgv s crd vyakllhr yrh i lglwqpd igpyu 

giilkwvdglfi igwmylpphdphvddpmrfkplfri hlmerka 

atvecmyghkgphhgriqivkkdkfstkcnqtdhhrmsggrqkk 

frtklreewgrtiiedi fhehmqelilmkfiytsqydncltyrri 

ylppsrpddlikpglfkgtygshgleivwdsfhgrrargrkltg 

dpnipagqqtveidlrhrzqlpdiiengi^ 

rqbqqegghkagbgrgrcx3prbsqp5paqprabapskgpdgtpg 

kdggepgdavaaaeqpaqcgqgqpfvlpvgvssrnedyprtc31m 

cfygtgl i aghgfts pertpcvf ilfdrdrfgfvwlelks fsdy 

srvqatfrnadapspqafdemlkniqslts 


6450 


848 


269 


FVPAPRTVSGKRSLPGEWEERGBGEQRTGRE FS GNGGRAVEAAR 
MRLLCG LWLWbS LLKVJ.QAQTPTPLPLPPPMQS PQGNQFQG EWF 
VLGIAGNSFRP EHRALLNAFTATFE LSDDGRFK VWNAM'l'RGQHC 
DTWS YVLIPAAQPGQFTVDHRVWTHEQAGR PQDQPAGQELVAAS 
RDAGPVHLPGQSSGPLG 


6451 


232 


939 


HSPTPPTSPRASTMEDVKLEPPSLPQCKBJJABEWTYPMRREMQE 
ILPGLFLGP YS SAMKS KL P VLQKHG ITHI IC I RQNTBANF I KPN 
FQQLFRYLVU3 IADNPVENI IRFFPMTKEFTDGSLO^KSGKVLVH 
GNAG I SR SAAFVTAYIMETFGMKYJU>AFAYVQERRFCINPNAGF 
VHQLQKYBAI YIiAKIjTIQMMS PLQIERSLSVHSGTTGSLKRTHE 
BBDDFGTMQVATAQKG 


64S2 


1 


652 


RTRGESSl#IEPI^YPI>KCSGPRAKVFAVI^IVLCrVTLFLI^ 
LKPIJCPKINSFYAFEVKDAJCGRTVSI^I^ 

LTDRNYLGLK^^KEPGPSHFS\n^PCNQFGBSEPRPSKJSVES 

FARKlffGVTFPIFirKIKJliGSEGBPAFRFIiVDSSKKEPRWNF^ 

YLVNPKGQWKFWRPF^PIBVIRPDIAALVRQVIIKKKEI) 


6453 


827 


223 


HRRWLPGI^MSPRRlT^PRPI^La^LCI^LCI^^ 
RDKKNCKVWSCKJELRKRLTPr^YHVTQEKGTESAFEGEYTHHK 
DPGrnCCnA/CGTPLFKSETKFDSGSGWPSFHDVINSEAITFTDD 
FS YGMHRVETSCSQ CG AHLGH I FDDGPR PTGKRYC INSAALS FT 
PADSSGTAEGGSGVASPAQADKABL 


6454 


827 


223 


HRRWLPGLSMSPRRTLPRPI^SIX3>SLCLCLCLAAAIX3SAG^ 
RDKKNCKWFSO^EIJtKRIjTPLQYHVTQER 
DPGIYKCVVC3GTPLFKSETKFDSGSGWPS FHDVINSEAITFTDD 
FSYGMHRVETS CSQ^TGAHLGH I FDDGPR PTGKRYCINS AALS FT 
PADS SGTAEGGSGVAS PAQADKAEL 


6455 


1042 


173 


R VHLATVSAS AAWDALGLP VFvSHMQGS TRRMG VOT FLOJL 

LMrHGVLEEWDVKRLOTOCYKVHDRNATTO^ 

S L Y I BI KRGVTBDDGRPI YALVlilLATTS ISKMATDFAENEIiDLF 

RKAbELI IDSETGEASSTNILNLVDG^KGKKMRKKEAEQVLQKP 

VQNKWLIEKEGEFTLHGRAILEMEQYIRETYPDAVK1CNICHSL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


{A= Alanine, C-Cysteine, D^Aspartic Acid, H= 
Glutamic Acid, F=Pbenylalanine , G=*51ycine, 
H=Histidine, l=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine, 

S=Serine, T=Threonine, V«Valine, 
W-Tryptophan, Y«*Tyro3ine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
Vpossible nucleotide insertion) 








XjIQGQSCBTCGIRMKLPCVAKYFQSNAEPRCPHCNDYWPHEIPK 
VFDPEKERESGVLKSNKKSLRSRQH 


6456 


2 


555 


R P QS RS I SMWRNS JULQ VS Sti L>KVl UKVLLftM V L> i. lA^KKHJj V I 

TVBAEAALQNKVVAIiYFAAARCAPSRDFTPIiLC^ 

RPAPFEWFVSAKJSSQEMLDFMRELHGAWL^ 

KRYNVTAI PKLVIVKQNGE^TNKGRKQIRBRGIiACTODWVEAA 

DIFQNFSV 


6457 


23 


892 


PTTGFPVTNFPWNWPDGKPPIMIIiYVSKLNKI IHFPDFDKKIPV 

KLFPLPLI*YVGNHISGI>SSTSKLSLPMFTVLRKFTlPLTIjbtjKT 

III/JKQYSIjNIILSVFAIILGAFIAAGSDLAFNIjBGYIFVFI^D 

IFTAANGVYTKQKMDPKBLGKYGVLFYNACFMI IPTLI ISVSTG 

DLQQATEFNQWKNWFIbQFLLSCFLGFIJi^ 

TTAWGA1KNVSVAY IGILIGGDY I FSLLNFVGI^ICMAGGLRY 

SFLTLSSQLKPKPVGEBNICLDLKS 


6458 


23 


892 


PTTGFP VTNFP WNWPDGKPP I M I LY VS KLNK 1 1 HFPDFDKKI PV 
KLFPI^LLYVGNHISGI^STSKI^LPMFTVLPJOT 
IILGKQYSLNIILSVFAI I LGAFIAAGSDLAFNLEGYI FVFLND 
IFTAANGVYTKQW^PKBLGKYGVIiFYNACFMIIPTLIISVSTG 
DLQOATBFKQWKNVVFILQFTjLSCF^ 

TTAVVGAIKNVSVAYIGILIGGDYIFSLTiNFVGI^ICMAGGLRY 
SFLTLSSQLKPKPVGBBNI CXJMjKS 


6459 


23 


892 


pttgfpvtnfpWmwpdgkppimilyvsklnkiihPpdfdkkipv 
klfplpllyvgnhisgi^stsklslpmftvt.rkftipbtiililbt 

III/jKQYSI^IILSVFAIILX3AFIAAGSDLAFNIjEGYIFVFLND 
I FTAAHGVYTKQK^PKE I/5KYGVLFYNACFM 1 1 PTL I ISVSTG 
DIiQQATEENQWKNVVFIIjQFLLSCFIiGF^^ 
TTAWGAIKNVSVAYIGILIGGDYIFSLLNFVGIiNICMAGGLRY 
SFLTLSSQLKPKPVGEBNICLDLKS 


6460 


23 


892 


PTTGFPVTNFPWNWPDGKP PIMILYVSKLNKI IHFPDFDKKI PV 
KL FPLPIXYVGNHI SGLS STS KIiSL PMFTVLRKFT I PLTLLLET 
1 1 LGBQYSLNI ILSVFAI I LGAFIAAGSDLAFNLEGYIFVFLND 
I FTAANGVYTKQKMDPKELGKYQVIiFYllACFMI I PTLI ISVSTG 
DLQQATBFNQWKNWFI LQ FLLSCFLGFLLMYS TVLCS Y YNS AL 
TTAVVGAIKNVSVAYTGILIGGDYIFSLLNFV^ 
SFLTLSSQLKPKPVGBBNICLDLKS 


6461 


1653 


360 


I^IFIQLKRHPERKYBLKEPTEEQQRIJ^ENGEINHODSMSr^; 
LISTSALENBHTFHSQIJjGASLTIJJjY^ 

DLVFS FVFGATSLSFSAPFVVHHCVNPJSDVRIiAW I MTCCPGRS S 

vm rra rinmD r>KTQ vt/l'PMni? & D Iff Ot3 Q Q Jk 1?*! nfTWK ^ JV^ QT71CW*I<ir> 
X Sj V\jyH VUr jrWcwU A-«V JSAJtr AA^cr JNo£JxU»£>£H-» IWAoWaor IvL«OOvc 

GCHCLTNLQAAAAQCHANSL PLNSTPQLDNSI jTEHSMDND I KMHV 
APLEVQFRTNVHS S RHHKNRS KGHRASRLTVLREYAYDVPTS VE 
ncvnwrrT.Pir <tot rmnrghsrsrraylAYREROYNPPOODSSDAC 
STLPKSSRNFEKPVSTTSKXDAIJtKPAVV^ 
QNGPI KSNGQEGPLIX^DSTGNVRTGLWKHETTV 


6462 


3 


773 


SEBLDREKKLKEDS prktpnkesgvpslpvslts ikeefkkakh 

PDSQSMERS KLKNDDRKTPVNMKDSRGTRVAVS SPMSQHQSY IQ 
YLHAYPYPQMYDPSHPAYRAVS pvlmhs ypgayls PGFHYPVYG 
kmsgrebtekvntspsvntkttteskai»dl^ 

PVEKATAEREREAERERDRHS PFGOJU^IiHTHHHTHVGMGYPL I P 
GQYDPFQGLOSAALVASQQVAAQASASGMFPGQRRE 


6463 


2 


350 


vilcilggwifknadrsMekkkgeprtraearpwvdedlkdssd 
lillqaebdadewqeseenvehipfshnhypekemvtcrsqefyrll 
nkrrsvrfisneqvpmkvidnvirtagl 


6464 


12 


1154 


GILRQKEREBRNRIHKKEIIJ'bEHLLWPSL^SLSGKVQTVLG 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

cor re spond i ng 

to first 

dnii.nu ociu 

residue of 
amino acid 
sequence 


Predicted end 

nucleotide 

location 

cor respond ing 

to first 

amino acid 

IcblQUc 

amino acid 
sequence 


Amino acid segment containing signal peptide 
[A- Alanine , C=Cysteine, D«Aspartic Acid, E= j 
Glutamic Acid, F= Phenylalanine , G=Glycine, 
H~Histidine, I=Isoleucine, K- Lysine, 
Ii=Leucine, M=Methionine, N=Asparagine , 
P^Proline, Q=Glut amine , R»Arginine , 
S=Serine, T=»Threonine , V==Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, +=Stop 
Codon, /^possible nucleotide deletion, 
\*possible nucleotide insertion) 








LVEPSKLGKTLTHBHIiAMTFDCCyCPPPPCQBAISKBPIVMKNL 

YW iqknayshkenlqlnqeteai keelly fkangggalventtt 

G ISROTQTLKRLAHKTGVHI I SGAGFYVDATHS SETRAMS VEQL 
TDVLMNE ILHGADGTS IKCGI IGEIGCSWPLTKSKRKVLQATAH 
AQAQLGCPVI IHPGRS SRAPFQI IRILQBAGADISKTVMSHIJDR 
TI LDKKELLEFAQLGCYIiBYDLyGTELIfiYQLGPDIDMPDDNKR 

P KMLLRG I TBNVLDK I L I ENPKQWLTFK 


6465 


126 i 


1396 


KflflVFFiCrLRNHWKKTTAGl^ 
QKAQVFGHQIjI PPKAQVKKA1 w c 

I.HI.SGMDVT I VKTDYEGGJUOCIjLEUIEBITDVI rVAGGDGTLQEV 
VTGVLRRTDEATFSKI PIGFIPLGETSSLSHTLFAESGNKVQHI 
TDATlAIVKGETVPIJWI^rKGEKE^ 

KVSKYWYLEPliKI KAAHFF^TLKEWPQTHQAS I S YTGP TERP PN 
EPEBTPVQRPSLYERn»RRI^YWAQF^ 

!^TIBI*SiriTU^IjPPTSKmDl^ICIEPDTISKGDFITIGSR 
KVRNPKXHVEGTECIiQASQCTLLI PEGAGGSFS I DSEE YEAMP V 
EVKbLPRKLQFFCDPRKREQMLTS PTQ 


6466 


1134 


828 


VARGTEI*SQI#BKAH^PAI)MGRRKSKRKPPPKKKMTO 

PFCHHEKSO>VKlTORAROTWlSC^^ 

YSDWIDACEAANQ 


6467 


301 


2571 


GB1iRVIjAIiAHGBLACHAVLTA£I»^ 
I KCVYVGIJI^VGKTRIjI CARACl^ 

YRVCQF^ERSRDVVDDVSVSLRLWDTFGDHHKDRRFAYGRSDV 
\An^FSIAKPHSIJIHVKTM^PEIKKOFCPRAPVILVGCQl J DLRY 
ADLEAVNRARRPLARP I KPNE ILPPE KGREVAKELG I P YYETSV 
VAQFG I KDVFDNAI RAALI SRRI ILQ FWKSHLRITVQRPIjLQAP FD 
PPKPPPPIIVVPDPPSSSBECPAHLLFJ^PIiCADVTIiVLiQKRVRI 
FAKlCIYI^TSSSKFYDLFl^IiSBGEl^PSBPGGTHPEPHQGH 
SDQHHHHHHHHHGRDFIiliRAASFD^^ 

SDGILRGNGTGYIjPGEGRVLSSWSRAFVSIQEEMAFJ3P1jTYKSR 
LHVWKMOSS IQPGPFRAVLKYTjYTGELDRNKRJDLMHIAHIAEIi 
LBVFDLRMMVAWI LNNFAFMNQBITKAFHVRRTI^VKEOCiAKGT 
FSDVTFlIaDIX^ISAHKl^ISSaJimAAMFGGPFVESSTO 
FPYTSKSCWRAVLEYIiYTXJMFTSSPDL^^ 
VALTEQYTVTGLMBATGWMVDirX3DVI»VF^ 
CI^IHICTNYNIWCRKFPRDMKAMSPENQBYFBKHRKPPVW 
BDHYQRARKBREKEDYIiHLKR^P^ 


6468 


3 


1374 


DAWAGTNMAAIiAPVGSPASRGpRLAAGl^^ 

LGRVHKIJUjKDD vrhkvhln tfgffkdgymvvnvss i>slne pbd 
i03vtigfs1^rtknix;fssyldbdvot^ 
srsbvrvksppeagtqiipki ipsrdfjcvijgqsqepnvnpasagn 

OTQKTQDGGKSKRSTVl^KAMGBiGSFSV^ 

TDIXJEGLYSLYFHKJCXGOTjPSDKFTFSIiDIBITEKNPDSYLSA 

GBIPfcPlOjYXSMAFPFFLSGTIWIHIljR^ 

PFTKSL^SbVFHAIDYHYISSOCFPIl^WAVVYYITHljLKGAT.LF 

ITIALIGTGWAFIKHIl^DKDKiQFMIVIPRRVLANVAYIIIES 

T12F^OTEYGLI^SIjFLVDIJjCCGAILFP 

GKGKFSRAHFVLLSLL 


6469 


3 


1374 


D AWAGTNMAALAP VGS PASRG PRLAAG1VRLLP WjGLI£2LIaAE PG 

LGRVHBlJUjKDDVRHKVHLin'FGFFKl^ 

HOTIGFSIjDRTKI^FSSYIiDBDVHYCS 

SRSEVRVKS PPBAGTQLPKI IFSRDEKVLGQSQEPNVNPASAGN 

Q^KTQDGGKSKRSTVI^KAMGEKSFSVHSWGGAV 

TDDQEGLYSLYFHKCLGKELPSDKFTFSLDIEITEKNPDSYLSA 
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Amino acid segment containing signal peptide 
(A=Alanine, C^Cyreteine, fr=Aspartic Acid, E= 
Glutamic Acid, F^Phenylalanine, G=Glycine, 
H=Histidine, I-Isoleucine, K-Lysine , 
L=Leucine, M=Methionine, N^Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=»possible nucleotide insertion) 








GEI PIiPKI>YI SMAFFFFLSGTI WIHILRXRRJTOVFXIHWLMAAL 
PFTK3I>SIiVFHAlDYHYISSQGFPrEGWAVVYYlTHLLKGALLF 
ITIALlGTGWAPlKHtl^DKDKKIFMIVIPHRVLANVAYIIIBS 
TKEGTTBYGLWKDSIiFLVDLLCCGAlLFPVVWS IRHLQEASATD 
GKGKPSRAHFVUjSIiL 


6470 


2726 


1437 


AAASGVSSPJU)APVLAQSPASAGNGRPSTPRVPGSRRHPSAPRS 
GPLPREDGCRTPGPQIiLPLPGALIjRPRTIiI»SSAAETGR5RHPI)T 
QHP3SGGRCRGGTBS PSSAAGRPASMAEAEEDQISDTVRADDDE 
ENES PABTDLQAQIiQMFRAQWMFELAPG VSS SNLBNR PCRAARG 
SLQKXSADTKGKQBQAKEEKAKELFLKAVEEBQNGALYEAIKFY 
RRAMQLV PD I E FKI TYTRS PDGDGVGNS Y I EDKDDDS KMADLLS 
YFQQQLTFQE5VLKLCQPBI£SSQIHISVLPMEVLMYIFRWVVS 
SDIJ)lJtSLEQI^LVCRGraamDPEIWIUiACXKVWGRSCIKI,V 
PYTSWREMFLFJ^RVRFDGVYISKTTYIRQGEQSbDGFYRAWHQ 
VEYYRYIRFFPTK3HVMMLTTPEEPQSIVPRLRTR 


6471 


1750 


293 


FFFDKMAAGGSGVGGKRSSKSDADSGFIJGLRPTSVDPALRRRRR 
GPRNKKRGWRRIAQBPIX5LEVDQFLEDVRLQERTSGGLLSEAPN 
BIOjFFVDTGSKEKGLTKKRTKVQKKSIXLKKPLRVDLIL 
VPAPKDVIJulQVPNAKiaJeRKEQLWEKIJUCQGELPP^VP^AQAR 
LI^SATRAKPGPQDTVERPPYDLWAEDNPLDRPIiVGQDE FFUB 
QTKKKGVKR PAR t^'KP SQAPAVEVAPAGASYWP3 FEDHQTLI*S 
AAHEVBLQRQKEAE KliERQIALPATEQAATQE STFXJFXCEGIJ^S 
ESDGEGEPGQGEGPEAGDABVCPTPARIATTBKKTEQQRRREKA 
VHRLRVOQAALRAAKLRHQELFRLRG I KAQVALRIAEIARRQRR 
RQARREAEADKPRRLGRLKYQAPDIDVQIiSS KI/TBSIiRTLKPEG 
NI IiRDRFKS FQKRlWJEPRBIO^KFKRKYB^/KtiVEKRl^FRBlQh 


6472 


3 


897 


SCGSDRAQWAMEFPFDVEAI>FPBRI'X r Vn^DQILLRPPARRPGTTTP 
ARVDLQQQIMTI IDBLGKASAKAQNI^SAPITSASRMQSNRHWY 
ILKDSS ARPAGKGAI IGFIKVGYKKLFVLDDRBAHNEVEPLCIIi 
DFYIHESVQRHGHGRELFQ YMLQKER VE PHQLAI DRPSQKIiLKF 
&NKHYNLBTTVPQVNNFVI FEGFFAHQHRPPAPSIiRATRHSRAA 
AVDPTPAAP ARKI*PPKRABGD I KPY3 SSDRKFLKVAVEPPWPLN 
RAPRRAXP PAHPP PRSSS LGNS PERGPLRPFVP 


6473 


22 


912 


SSAVEFVWEGBKMAAEPNKTBIQTIJ?KRLRAVPTNKAi^IXX^ 

NPS WAS ITYGVFLCIDCSGVHP^IiGVHLSFIP^TELDSNHNWFQ 

I^CMQVGGNAHATAFPRQHGCTANDANTKYNSRAAQMTO 

MSAALARHGTDIsWII^SSAVPNHSPEKKDSDFFTEHTQPPAH 

nAPATEPSGTQQPAPSTBSSGLAQPBHGPNTDIJLGTSPKASLEL 

KSSI IGKKKPAAAKKnLGAKKGLGAQKVSSQS PSEIERQAQVAE 

KLREOQAADAKKQAEESMVASMRLAYQELQIDR 


6474 


3 


462 


I^RQRQHPAAAPAVPVRCFTFCFTDIVIMPKRKSPENTEGKDGS" 
KVTKQBPTRRSARLSAKPAPPKPEPKPRKTSAKKEPGAKXSRGA 
KGKKBEKQEAGKEGrAPSRNGETKAEEIHISRSTVNVSTSRGTP 
PSTLSVKGQIETVRVKGTBN 


6475 


3 


ACT 


LORQRQHPAAAPAVFVRCPTFCFTDrVIMPKRKS PBNTEGKDGS 
KVTKQEPTRRSARLSAKPAPPKPBPKPRKTSAKKEPGAKISRGA 
KGKKEEKQEAGKEGTAPSENGETKAEE IHISRSTVMVSTSRGTP 
PSTLSVKGQIETVRVKGTBN 


6476 


106 


1090 


ARAMAQYKGTOREAGRAMHLLIUaiERQREQ 
KSQVDKRFSAHYOAVKAELKSSTVGLVTL^^ 
RQLAKRQHIiBEQRLQQERQRRQEQRRERKRKISCIiSFAIiDDLDD 
QADAAEARRAGNI/3KNPDVDTS FLPDRDREEEENRLREELRQKW 
EAQREKVKDEEMEVTFSYWDGSGHRRTVRVRKGNTAfQQFLKIGVL 
QGLRICDFLELRSAGVEQLMFIKEDLIIJ?HYHTFYDFI IARARGK 
SGPLFSFDVHDDVia»LSDATMEKDESHAGKVVLR5WYEKNK^ 
PASRWBAYDPEKKWDKYTIR 
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Amino acid segment containing signal peptide 
(A^Alanine, C»Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine , 
I»»Leuczne , M=Me t alanine , KfeAsparag ine , 
P=Proline, Q=Glut amine , R=Arginine, 
S^Serine, T=Threonine, V=Valine, 
WaTryptophan, Y^Tyrosine, X^- Unknown , *=Stop 
Codon, /=possible nucleotide deletion, 
\=»possible nucleotide insertion) 


6477 


227 


915 


LQGHLMG I MAAS RP L SRFWEMGKNI VCVGRNYADHVREI>IRS AVI* 
SEPVTjFLKPSTAYAPEGSPIIilPAYTRNIiHHBDE^GVVMGKRCR 
AVPEAAAMDYVGGYALCHjT)MTARDVQDECiaCKGLPW FTA 
S CP VSAFVP KBKI PDPHKLKLWLKVNGELRQEGETSSM I FS IPY 
I IS YVSKI ITLEEGDI ILTGTPKGVGPVKENDEIEAGIHGLVSM 
TFKVKKPHY 


6478 


2 


1495 


FVSSRILPBSLASSSASTIiEAMGRKBEDDCSSWKKQTTNIRKTF 

IFMEVl^SG^SEVFLVKQRLTGKLFALKCIKKSPAFRDSSLEN 

BIAVLKKIKHBNIVTLEDIYBS*1THYYLVMQLVSGGBI,FDRILE 

RGVYTEKDASLVIQOVLSAVKYLHENGIVHRDLKPENLLY^ 

ENSKIMITDFGLS KMEQNG IMSTACGTPGYVAPEVLAQKPYSKA 

VDCWS IGVITYILLGGYPPFYEBTESKLFEKt KEGYYBFBS PFW 

DDISESAKDFICHIiEKDPNBRYTCBKALSHPWIDGNTAIiHRDI 

YPSVSLQIQKNFAKSKWRQAFNAAAVVHHMRKL^^ 

EVENRPPBTO^ETSRPSSPEITITBAPVLDHSVAI*PAIiTQI.PC 

QI IGRRPT APGGRS LNCLVNGS IiHISSS LV PMHQG5 LAAGP CQCC 

SSCLNIGSKGKSSYCSBPTLIjKKANKJCQNPKSEVMVPVKASGSS 

HCRAGQTGVCLIM 


6479 


3 


949 


SCRGPGWK^AGWp^GAMEU^ALSLGBIJ^ 
YFIVSILYLKYEPGAVEI^RRIIPIASlflJCAMLHCFGSYIIADI^ 
U3BPLIDYFSNNSSILIASAWYLI FFCPIiDLFYKCVCFLPVKIj 
IFVAMKBVVRVRKIAVGIHHAHHHYHH^^ 
LMSNFEQLLRGVWKPETNE I LHMSFPTKASLYGAI LFTLQQTRW 
LPVSKASLIFIFTI*FP1VSCKVFLTATHSHSSPFJDAIiEGYICPVL 
FGSAO»DHHHDNHGGSIISGGGPGAQHSAMPAKSKKKI>SEGSRK 
KKAKKAD 


6480 


192 


514 


DFMSIYFPIHCPDYLRSAKMTEVMMNTQPMEEIGLSPRKDGLSY 
QIFPDPSDFDRCCKIjKDRLPSIWEPTEGEVESGELRWPPEEFIi 
VQEDKQDNCBETAKBNKEQ 


64 81 


110 


1131 


KSRMDLDWNMFVIAGGTLAI PIIAFVASFLLWPSALiIRI YYWY 
HRRTLGMQVRYVHHEDYQFCYS FRGRPGHKPS I LMLHGFSAHKD 
MWIiSVVKFLPKNLKLVCVDMPGR^ IDGQVKR IH 
QFVECIJCUJKKPFm.VGTS^GQVAGVYAAYYPSDVSStiWLVCP 
AGLGYSTDNQFVQRLKELOGSAAVEKJPIiIPSTPEEMSEMIjQLC 
SWP^IWPOXJir^IiVDVRIPHNNF^YRKI.FLBIVSBKSRYSliHQ 
NMDRTKVPTQIIWGKG5XJVIJ^SGaDMIiAKS IANCQVEtiLENCG 
RSWMERPRKTAKL 1 1 DFXASVHNTDNNKKIiD 


6482 


2517 


568 


EPVSKVSQSRRKAGVPTANI EESOAVEAAMANVPHABVCBKFQA 
ALAI^RVELHKNPEKEPYKSKYSARALLEEVKALLGPAPKDRDE 
RPEAEDGPGAGDHALGLPAEWEPEG PVAQRAVRLAVIBPHLGV 

LWS ERBB I ETAQA YLES S EALYNQYMKEVGS PPLDPTBR FLPBE 
BKLTEQBRS KRE^KVYTIINLYYIiAQVYQHLEMFKKAAHYCHSTL 
KRQLBHNAYHPIEWAINAATLSQFYINKLCFMEAl^CLSAAbP/I 
PGQTGKISATEDTPBAEGBVPEI.YHQRKGEIARCWIKYCLTLMQ 
NAQl*SMQDNIGEIiDTiDKX^BIiRALRKEEIJ)EEES IRXKAVQFGT 
GELCmiSAVEEKVSYI^PI^FEEARBiFIJ^HYVFBAKEFFQ 
IDG YVTDH I BVVQDHS ALFKGLAFFETDMERRCKMHKRR IAMLE 
PLTVDLNPOYYLLVNRQIQFEIAHAYYDMMDIiKVAIADRLRDPD 
SHI VKKlIWIiNKSALKYYQIjFLDSIiRDPNKVFPEHIGEDVIiRPA 
MLAKFRVARLYGKI I TADPKKELENLATSIiRHYKFIVDYCEKHP 
EAAQBIBVEItBLSKEMVSLLPTKMBRFRTKMALT 


6483 


3 


623 


NSULLCGLRARAPLSANGREiUiAMKQRliAEF 
PAASQGAG^PGEKAEAAATLKAAPGWIiKRFLVWKPRPASARAQP 
GLVQEAAQPCGSTSBTPHNTAIPLPSCWIX^SFL^^ 
LVLIXSI^VBLErciAYFVBSLFYWMYVGTRGPBEKKEGEKSAYS 
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Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E~ 
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P= Pro line, Q«Glut amine, R=Arginine, 
S=Serine, T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X -Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








VFNPGCBAIQGTLTAEQLERELQLRPLAGR 


64B4 


201 


965 


QIAVKTKMSGIjRPGTQVDPEIELFVKAGSDGSSIGNCPFCQRLF 
MII^KGVKFNVTTVTWraK^ 

DPI KIKE FIiBQTLAPPRYPHI*SPKYK^FDVGCNIiFAKFSAYI K 
NTQKEANKNFE KSLLKE FKRLDDYLNTPLLDE IDPDS AEEPPVS 
PJfcLFliIXSDQLTLADCSLLPK^ 
RYIJINAYAREEFTHTCPKDKEI BinTANVAKQKS 


6485 


6 


1091 


FVDbVRRVBFLPCPDSQKLEKBOQSSEESMGSNSMHS I LKBDEE 
rip RUDin/T.T.YHRPR Q FI^TnMLVWHKHKKYPPWPAVVKSVRQRDK 
KA5VLY ZBGHMNPKMKG FTVSI^IiKHFOCKEKlQTLLNQAREDP 
NQDIGWCVSLITDYRVRLGOGSFAGSFLKYYAADISYFVRKSIQ 
QDVIjGTKLPQLSKGSPEEPWGCPLG^QPCRKMLPDRSRAARD 

CVETYI^BGQLDLWKnn^QGVYQWG^KVKJRT^DRIRFII^ 
VbliPBAI ICAISAGDEVDYKTAKEKYIKGPSLSYREKEIFONQL 
LEERNRRRR 


6486 


10 


581 


LVbQAGGAHLSP^RVTQG I YYMLAFSEMPKP PDYSELSDSLTLA 
ryTVOWOTDT HP&tTOMMWTn?nPMGWIGVGI»YLIJ\SAAAPYYVFB 
ISETYNRlAI^IQQHPEEPIiEGTTWTO 

I FLVP YI^MFTjPTj YSCTRADPKTVGYC 1 1 P I CLAVI CNRHQAFV 
KASNQISRLQLIDT 


■ 6487 


352 


863 


SPLKPLRGKMSVTLHTDVGDIKIBVFCERTPKTCENFLALCASN 
YYNGCIFHRNI KGFMVQTGD PTGTGRGGNS I WGKKFBDE YSBYIi 
KHNVRGWSMANNGPNTNGSQFFITYGKQPHLDMKYTVFGKVID 
GLETLDELBKLPVNEKTYRPLNDVHI KDlTIHANPFAQ 


6488 


878 


241 


TALQE FGT6GP PLSLRFALPSGTGRFKPIiPGARGPS WPPS PRVP 
mrppmt .VPVRT.WYnTiR KGItARHLSP IMLGKOIiEGIWHTS I WH 
KDBFFFGSGGISSCPPGGTLLGPPDSVVDVGSTEVTF^IFIiEYIj 
SSLGESIJ^GEAYNLFEHNCNTFSNEVAQFLTGRKIPSYITDrJ? 
SEVLSTPFGOALRPIiLDSIQIQPPGGSSVGRPWGQS 


6489 


1457 


375 


KVAKMATALSEKELDNEDYYSIJLNVTi 
YHPDKHRDPELKSOAKRLFNIjVHQAYEVLSDPQT^ 
GLBMBGWBVVBRRRTPABIREBFBPJL^REREERRLG^RTNPKG 
ISVGVDATDLFDRYDBEYEDVSGSSFPQI EINKMHISQS I EAPL 
TATDTAI LSGSI*STQNGNGGGS INFAI»RRVTSAKGWGEI*EFGAG 
DLOGPLFGIiKIiFRNI>TPRCFVTTNCAI»QFSSRG I RPGI/TTVLAR 
NI^KBnVGYIX^HCSSPIiU)VaRPHRNTPJU^ 
TWDAKraGARTPSTAWTSAAVKLREACLSGPGSGSHQIJ J U»TPR 
SKRRTGGG 


6490 


3 


1183 


HEAGCEVWIiG YGP RAAAAAAATVLFGGAG PTETM FVARS IAADH 
KDLIHIWSFDFHGRRMATCSSDQSVKVWDKSESGDWHC^ 
HSGS VWRVTWAHP E FGQVIiASCSFDRTftAVPTEE I VGESNDKLRG 
QSHWVKRTTLVPSRTSVTDVKFAPKHMGIJ4LATC3ADGIVRIYS 
APDVFmSQWSL01IEISCKI^(^CISWWPSSSRAHSPMIAVGSD 

dsspnamakvqi feynemtrkyakaetlhtvtdpvhdiafapnl 
grsfhi lai atkdvriftlkpvrkelts sggptkpe iktvaq fd 
nhnsqwrvsiwitg™lassgdixx^lwkanymdnwkctgil 
}«^gspvngssoggtcnpslgsnipsi^nsiingssagpj(hs 


6491 


3 


1183 


heagcbvwlgygpraaaaaaatvlfggagptetmfvars iaadh 
kdlihdvsfdfhgrrmatcssdqsvkvwdksesgdwhctasvikt 
hsgsvwrvtt^pefgqviascsfdrtaavweeivgesndklrg 
qshwvkrttlvdsrtsvtdvkfapk^^ 

APDVMNI£QWSI<JHErSCKLS(^CISWNPSSSRAHSPHIAVGSD 
BSSPNAMAKVQIFEYNENTRKYAKAETIjM^ 
GRSFHIIAIATKDVRIFTLKPVRKBLXSSGGPTKFB1HIVAQFD 
NR^SQVWRVSWNITGTVI^SGDIX^CVRliVrKAl^DNWKCT 
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S=Serine, T=Threonine, V=»Valine, 
W -Tryptophan, Y-Tyrosine, X-Un known, *=Stop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 








KGNGSPVNGSSQQGTSNPSLGSNIPSLQNSLNGSSAGRKHS 


6492 


34 


2573 


IPFLKSCCCCCLFDFPPPPIJ3QVQEEBCEVBRVTEHGTPKPFRK 

PDSVAFCTSC^HDEQPKNDLETDPPNWQQLVSREVIXGLKPCEI 

KRQEVI NBLFYTKRAHVRTIiKVLDQVFYQRVS REGILS PSELRK 

IFSHLEDILQIJIIGLNBQMKAVRKRNBTSVIDQ^ 

PGBEKLKHAAATPX2SHQPFALEMIKSRQKKDSRFQTFVQI1ABSN 

PLC31RLQLKDI IPTQMQRLTKyPLIJADXIATYT^ 

AADHCRQILNYVNQAWEAENKQRLEDYQRRLDTSSIJCLSBYPW 

VEBLRNliDLTKKKMIHBGPLVWKVNRDKTIDLYTI^ 

QKQDDRIiVLRCHSKXLASTADSKHTFS PVTKLSTVLVRQVATDN 

KAr,FVlSMSDNGAQIYELVAQTVSEKTVWQDLICRMAASVKEQS 

TKPIPLPQSTPGBGDNDEEDPSKLKEEQHGISVTGLQSPDRDLG 

LESTLlSSKPQSHSL.STSGKSEVRDLFVAERQFAKKQHTDGTtiK 

EVGEDYQIAI PDSHIipVSEERWALDALRNLGIiLKQLLVQQLGLT 

BKSVQEDWQHFPRYRTASQGPQrrDSVIQNSENIKAYHSGEGHMP 

FRTGTGDIATCYSPRI'STKSFAPRDSVGIUAPQDSQASNILVMDH 

MIMTPF^PTMEPEGGUDDSGEHFFBAREAHSDEin>SEGIX3AVNK 

BEKDVNLRISGNYLILDGYDPVaESSTDEEVASSLTLQPMTGIP 

AVBS THQQQHS JKOTHSDGAI SPFTPEFLVQQRWGAMEYS CFE I 

QSPSSCADSQSQIMBYIHKIEADI^HLKKVEESYTILCQRIAGS 

ALTDKH3DKS 


6493 


557 


1147 


TPARMAYOGSSTSDCMSKTLDSASAHFAASAWSAPVPSRSEVA 
KEQNTGHNNTNGWQPSGTS KTLYS TNMALS SS PG I S AVQLVRl" 
VGHTTTNHLI PALCTSSPQTT,PMNNSCLTNAVHLNNVSWS PVN 
VHLNTRTSAPS PTALKLATVAASMDR VP KVTPS S AI S S IARBNH 
EPERLGLNGIABTTVAMEVT 


6494 


2425 


1052 


AVAGGARPCSTPSSPHRRCRRHRPRPLPRPPAAIMSASAVYVLD 
LKGKVI» I CRNYRGD VDMS EVEHFMP I LMEKEEEGMLS PI LAHGG 
VRFMWI KHNNLYLVATSKKNACVSI,VFSFLYKVVQVFSE Y FKEL 
EEBS IRnNFVTIYELLDELMDFGYPQTTDSKILQBYITQEGHKI, 
ETGAPR P PATVTNAVSWRSBG I KYRKNBVPLDVI BSVNLLVSAN 
GNVLRSBIVGSIKMRVFLSGWPBUo^LiroKVLFDNTG 
VELBDVKFHQCVPJLSRFENDRTISFI PPDGEFELMSYRIWTHVK 
PLIWIESVIEKHSHSRlEYMIKAK^FKRRSTANNVEIHIPVPlil 
DADSPKFKTTVGSVKNVPENS B I VWS I KS FPGGKByjjMRAHFGL 

PWVRYITQNGDYQLRTQ 


6495 


2425 


1052 


AVAGGARPCSTPSSPHRRCRRHR PRPLPRPPAAIKSASAVYVLD 
T.KTJKVT.TrRNYRf^nVT^^PrVTWTWPTT.MRlTKWra^rjciPTT XKf!P 
VRF^ IXHNl^YLVATSKKNACVSL VPS FJL YK WQVFSEYFKEL 
BEES IRDNFVI IYBLLDEX^FGYPQTTDSKXIjQEYI TQEGHKL 
BTGAPRP PATVTNAVS WRSEG I KYRKNBVFLDVIES VNLLVSAN 
GNVLRSEIVG3IKMRVFIjSGMPEIiRX<GIjNDKVLFUNTGR 
VELEDVKFHQCVRJjSRFKNDRTISFIPPIX?EFBXMSYR1^^ 

pliwibsviekhshsribymikaksqfkrrstannvbihipvpn 
dadspkfktt vgsvkwvpense ivw3 iks ppggkeylmrahfgl 
psveakdkegkpp i s vkfe i py fttsgiq vrylkiie ksg yqal 
pwvryitqngdyqlrtq 


6496 


247 


559 


LRAVSIiLPLQLVLPEYSIHSLFCIMFI^^BWLTI^^ 

HFWRYFHCPADSSELAYDPPVVMItM>TLSYCQKEAWCK^ 

SFFYYLYCMIYTLVSS 


6497 


1053 


352 


A^ICRLCPRRHl^PCGAKNGNGTEEDYNFWKVVLIGKSGV 
GKl'NLLSRFTRNEFSiroSRTTIGVEFSl'RTVMIjG'rAAVKAQIWD 
TAGI^RYRAITSAYYRGAVGAIiLVFDLTKIIQTYAVVERWIiKELY 
DHAEATIVVMLVGNKSDI^QAREVPTEEARMPABNNGLIiFIiETS 
ALD5TNVBLAFETVLKEX FAKVSKQRQNSIRTNAITLGSAQAGQ. 
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NO: 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(At-Alanine, C=Cysteine, D»Aspartic Acid, E= 
Glutamic Acid, F~Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K^Lysine, 
LoLeucine, H=Methionine, N=Asparagine, 
p=proline, O^Glutamine, R-Arginine, 
S=Serine, T=Thxeonine , V= Valine, 
W ^Tryptophan, Y=Tyrosine, X=Unknown, *=-Stop 
Codon, /^possible nucleotide deletion, 
\=poesible nucleotide insertion) 








EPGPGBKRACCI3L 


6498 


2636 


272 


SLRLCPWGTHLAGPTTMRLSSLlJUaljRPAIi PL I LGLS LGCS LS h 
LRVSWIQGEGEDPCVBAVGERGGPQNPDSRARIiDQSDEDFKPH I 
VP YYRDPNKP YKKVLRTRY I QrTEIXSSRERLLVAVIjT^RATLSTL 
AVAVNRTVAHHFPRLLYPTGQIK5ARAPAGMQWSHGDERPAW 
SETI»RmHT»FGADYDWPPIMQDDTYVQAPRI^^ 
LYIX3RAEEFIGAGEQARYCHGGPGYI»I>SRSIiMjRIjRPHLD^^ 
DIL3ARPDBWIjGRCI>IDSLGVGCVSQH0GQQYRSFEIiAKNRDPE 
KEGSSAFLSAFAVHPVS EGTLMYJEIIiHKRESAIjEI>ERAYSE ieqi* 
QAQ IRNbTVLTPEGEAGIiSWPVGl.PAPFTFHSRFEVLGWDYPTE 
QKTFSCAIX^PKCPXQGASRADVG^ 

QKQRLIJ^YRRFDPARGMEYTI^LLLECOTQRGHRRALARRVSL 

IJlPI^RVEILPMPYVTBATRVQLVIiPLLVABAAAAPAFLEAFAA 

NVLEPREHALLTI^VYGPREGGRGAPDPPIiGVKAAAAEliERRY 

PGTRIJVV^VRAEAPSQVRIttDVVSKKB^ 

pBVWmCKMMAISGWQAFFPVHro^ 

DPPSPPGADPSRGAPIGGRFDRQASAEGCFYKADYLAARARLAG 

ELAGQEEEEALEGLEVMBVFLRFSGI^^ 

CS PRUS EO^YHRCRLSNLBGLGGRAQIJ^ T 


6499 


3 


2040 


SCSADTRPSGQAW PTVGLRAAAGAFRTGS PLALUPET PQVAOjP 
GHPPVRPQVSGGPGAMPDPAAHLPFFYGS ISRAEABEHLKLAGM 
AIXJLFLLRQCLRSLGGYVLSIjVHDV^ 

GKAHCG P7\ELCEFYSRDPDGLPCNLRKPCNRPSGLEPOPGVFDC 
LRDAK^DYVRG^IOjEGEAZiEQAl ISQAPQVEKIjIATTAHFJ?M 
PWYIISSLTREEAERKLYSGAQTDGKFbliRPRKEQGTYALSLIYG 
KTVYHYIjISQDKAGKYCI PEGTKFDTLWQLVEYLKLKADGLIYC 
LKEACPNS SASNASGAAAPTIi P AHPSTIVTKPQRR I DTLNSDGYT 
PEPARITSPDKPRPMPMDTSVYESPYSDPEEIjKDKXLFIjKRDNL 
LIADIEI^CGNFGSVRQGVYRMRKKQ I DVAI KVLKQGTEKADTE 
EMMREAQ imhqldnpyivrli GVC$AEA1^VMEMAGGGPL»HKF 
LVGKREE IPVSNVAEIiLHOVSMGMKYIjEBK^ 
VNRHYAKISDFGT^KAIjGADPSYYT7«SAGK3^LKWYAPB 
RKFSSRS0VWSYGVTMWBALSYGOKPX1OWKGPEVMAFIBQGKR 
MECPPECPPEI.Y ALMSDCW1 YKWEDRPDFI>TVEQRMRACYYSLA 
SKVBGPPGSTQKAEAACA 


6500 


1773 


726- 


TGPTHT^AnAWLVRSVTEWCANVRGWPCAAA^ 
MI£BSSSFI*KGVMIA3SIFCALITM^ 

QAPNKEDI LKISEDERMELSK3FRVYCI I LVKPKDVSLWAAVKB 
WTKHCDKAEFFSSENVKVFES INMDTNDMWLPWRKAYKYAFDK 
YRIX2YNW?FLARPTTFAXIBNLKYFI*LEa>PSQPFYLGHTI KSG 
DLEYVGMEGGIVIiSV3SMKPJ^Sl^IPKKCPEQGGMIWKISEI) 
KQIAVCi KYAGVFAENAEDADGKDVFNTJCSVGliS I KEAMTYHPK 
QWEGCCSDMAVTFl«3LTPNQbfB^7HMYGVYRIiRAFGPYFQ 


65*01 


1 


570 


LV5MSGGGXETPVGCEAAPGGGSKKRDSLGTAGSAHLIIKDLGE 

XMSKXtJUJLUuvJrVlvUlx iKir vivisi aDAAuuiuu uunumiiiAMu 

TNEimiPKauyiMRDSIiSQVIjQRI^ 

HSDKliVASBXQHMIiQWDNFMKE^ 

YAEMEKDLAKPSTF 


6502 


213 


1650 


IfUTTOP D P WAGRNRTAVLPDVS V FHREi) VGWWRSWLQQ S YQAVKE 
KS SEALB FMKRDLTE FTQWQHDTACT IAATAS WKB KLATEGS 
SGATEKMKKGLSDFLGVISDTFAPSPDKTIDCDVITriMGTPSGl' 
AEPYDGTKARJbYSI^SDPATYCNEPDGPPELFimWLSQFCLEEK 
KGEISBLLVGSPS IRALYTKMVPAAVSHSEFWHRYFYKVHQLEQ . 
EQARRDALKQRAEQS I SBEPGWEEEEEBLMGIS PI S PKBAKVPV 
AKISTFPEGEPGPQS PCEENLVTSVEPPAEVTPSBSSESISLVT 
Q I ANPATAPEARVLP KDLSQKLIjRASLBEQGLAVDVGETGPS PP 
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sequence 


Afflj.no tic id KE^cnnGnfc con h pi "i ni ncx fticm^il T*w%t~ 4 
(A-Alanine, C-Cyeteine, D=Aspartic Acid, E- 
Glutamic Acid, F=Pbenylalanine r G=Glycine, 
ifoHistidine, I=Isoleucine, K=Lysine, 
L-Leucine, K=Methlonine, N»Asparagine, 

Proline, Q=Glutamine, R=Arginine, 
S»Serine, T^Threonine , V=Valine, 
Wt=Tryptophan, Y=Tyrosine, X^unknown, *-Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








IHSKPLTPAGHTGGPEPi^PARVBTIiRBBAPTDLRVFELMSDSG 
Its l ^bWMtjKJ^SSTDISKDWEKDF DIJjPrrEEEVQMALSKVDASG 
EVSGPGGSEGSEPNGPGCESS PQPAQLS PQEGPCSCLR 


6503 


213 


1650 


AGNKPDPWAGRNKTAVLPDVSVFHREDVGWHRSWLO^SYQAVKE 
KSSEALBFMKRDLTBPTQfVVQHnTACrriAATASWKEKL^^ 
SGATRKM KKGLSDFLG VISDTFAPS PDKTIDCDVITXMGTPSGT 
AEPYDGTKARLYSLQSDPATYCNEPDGPPELFDAWLSQFCrjEEK 
KGEI SFiLVGSPSIIUa>yTKMVPAAVSHSEFWHRYFYKVHQLEQ 
EQARRDAIiKQRAEOSISEEPGWEEEEEKLMGISPISPKEAKVPV 
AKI STFFEGBPGPQS PCEKNLVTS VBPPABVTPSESSES ISLVT 
QIANPATAPEARVLPKDLSQKLLEASLEEQGIAVDVGETGPSPP 
IHS KPLTPAGHTGGPEPRP PAR VETI>RE EAPTDLRVFELNSDSG 
KSTPSNNGKKGSSTDI SEDWEKDFBLDMTBEBVQMAltSKVDASG 
BVSGPGGSEGSEPNGPGCESSPQPAQLSPQEGPCSCIA 


6504T 


2131 


1294 


GKVCTiVAHWVCLSILS PPPAGMKTPNAQEAEGO^TRAAAGRATG 
SANMTKiaCVSQKKQR<3RPSSQPaWIVGCRISHGWKEGDEPIT0 
WKGTVIjDQVPINPSIjYLVKYDG IIXTVYGIiELHRDERVLSIjKILS 
DRVASSHISDANLANTI IGKAVBHMFEGRHGS KDEWRGMVLAQA 
P I MKAWF^1TYKKDPVI> YM YQLLDDYKBGDLR IMPES SES P PTE 
REPGGVVDGLIGKHVEYTKEDGSKRIGMVIHQVKAKPSVYPIKP 
DDDFHI YVYDLVKKS 


*50S 


2131 


1294 


GKVCLVAH WVCL3 I LS P P P AJ31KT PNAQEAEGQQTRAAAGRATG 

SANMTKIOCNraQlC<QRGRPSS0PCRlJIVGCRISHGWtCE^ 

HKGTVLDQVP IN PS LYLVKYDGI DCVYGLELHPJDBRVLSLK ILS 

DRVASSHISDANLANTI IG KAVEHM FEGEHGS KDEWRGMVLAQA 

PlfUCAWFYITYEKDPVLYMYQLLDDYKEGDtRIMPESSBSPPTE 

REPGGVVDGXIGKHVEYTKEDGSKRIGMVTHQVEAKPSVYFIKP 

DDDFHIYVYDLVKKS 


*506 


1 


1350 


EVS P PTS CCLTVAVADPG VS EGFRG FGAGCEM PGRGRCPDCGS T 

E LVEDSHYS QSQLVCTDCX5 CVVTEG VIjTTTFSDEGin J?FTVTYS R 

S rGENEQVSRS QQRGIJWVRDLatVLQLPPl'FBDTAVAYYQQAY 

RHSGIRAARLQKKEVLVt^CVLITCRQFINWPLTMGAI CTLLYAD 

LDVFSSTYMQIVKIiGIiDVPSU3tJu^V^ 

PAKYVEDKE KMLSRTMQLVE LANBTWLVTGRH PLP VI TAATFLA 

WQSLQPADRLSCSIJUIFCXIJSHVDIjPYP 

EQIAWIJiVLRLDIO^VVKHIGDIJ^HROSLVRSAFRDGTABVET 
RBKBPI»GWGQGQGEG3VGNNSI^P^ 

PKRICPVPPVSTVTGDKNISDSEIEQYLRTPQBVRDFQRAQAAR 
QAATSVPNPP 


6507 


1878 


929 


RSHASRIiPEI^SGCIjVLQVQEl,VQHSGMEArVTI P I WQNKPHGA . 

ARSVVRRIGTNI^LKPCARASFBTL^ 

IAWIAADEEETYARVRSDTRPLRIITWKPSPLIVMQ^ 

GSBERLLALKKPALPAI^TTBLQDFJ^KiRSQIAKIVAADA 

ASLTPDF1>3PGSSNVSSPLPCFGSSFHSTTSFVISDITEETEVE 

VPELPSVPIiLCSASPECOCPEHKAACSSSEEDDCVSIvSKASSFA 








EDISRKGN 


6508 


862 


342 


WEARKRPQRWPSBRREVRVPPPHLQRGRSGLEPGTFRKMAAARP 

SIXJRVI^GSSVI^LCD^EKFRHNIAYFPOIVSVAARMLKNTT^ 

DliDRGLQVHVVVDACSSRSQVDRLV^^ 

I LGL VGDAVHPQ FKE X QKLIKE PAPDSGLLGLFQGQNS LLH 


6509 


2 


1053 


FVlWPRGGRKRRRQJVAVTQAATRASGTPSPRDGmiXjGiCLSVAN 
KAPGTEGQQQVHGE KKRAPAVPSAP PSYEEATSGEGM KAGAFPP 








RRVFTOKVYTILLIQI»LVTIiAVVALF^^ 
ASYAVF^ATYLTLACCSGPRPJIFPWNLILLTVFTLSMAYL 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A» Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Pheaylalanine, G-Glycine , 
H=Hietidine, I=Isoleucine , K=Lysine, 
L= Leu cine, M=Methion±ne, N*=Asparagine, 

Proline, Q^Glut amine, R^Arginine, 
S=Serine, T>=Threonine , V=Valine, 
W=Tryptophan, YVTyrosine, X^Unknown , *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








SSYYNTT5VLLCLGITALVCLSVTVFSF0TKFDBTSCQGVLFVI* 

LimjFFSGLILAILIiPFQYVPWI^VYAALGAGVPTIxPI^ 

LLMGNRRHSI^PEEYIFTSAI^IYI^IIYI 


6510 


37 


1156 


PCALDGCPQRGAVH PIiLSSAT^IJLAFIiKa^B^VLHIJ^VGFWVVS 
GLVINFVQLCTLALWPVSKQLYRRLNCRLAYSLWS QLVMLLBWW 
SCTBCTLFTDQATVERFGKBHAVI ILNHNFEIDFLCGWTMCERF 
GVLGSSKVliAKKBIJj YVPLI GWTWYFLE XVFCKRKWEEDRDTVV 
EGLRRXiSDYPE YMW FljLYCHGTRFTBTKlfl? VSMEVAAAKGLPVL 
KYHLLPRTRGFTTAVXCLiRGTVAAUYDVTLNFRGNKNPSIjI^I l 
YGKK¥EADMCVRRFPI*BDIPIJ>EKR 

NQKGMFPGEQFKPARRPWTIXNyi^WATIIJ,SPLFSFVLGVFAS 
GS PLLTLTFLGFVGAGNGHCR 


6511 


2541 


1425 


GEEQPLAAAPTBCLEQVIGGAGDPGTWAS FPS PLPGPAPLKGGK 

KYPDSKSVCIjRGCPKVTEISNVICCVTRLPEETKRQAVAX IFTDD 

SARTFTCDSKI^ABBWYKTI^VEa/^RLMDISLCffi 

QCEQTDR Fl'TVFLIJPCPNIiDVYGB CKLQI THENI YLWD IHNPRVK 

L VS WPLCSLRRYGR1)ATRPTFHAGRMCIIAGEGLYTFXJTQEGBQI 

YQRVHSATLAIAEQEKRVLLEMBKNVKLLNK^ 

^PRSAYWHHITGSQNIARASSYAGEGYGAAQASSETDIiLNRFI 

tXKPKPSQGDSSEAKTPSQ 


6512 


159 


907 


KJKKS1WFPLSRSLRVAS GRSCKIjGHGGYTGSGPGFGK PRDSGA 
EVPSGSGRATGCKRGGVRGARQGPJVPGSSIWRKEPRMVCTRKTK 
TLVSTCVILSGMTNI ICLLYVGWVTNYIASVYVRGQEPAPDKKI, 

LFAHWGQELSPEGRRVALKQFQYYGYKAYLSDRLPLDRP 


6513 


2 


756 


FVSPEPGFSLAQLNLIWQLTiyrKQI.VHSFAKGQDOGSAYANRTA 
LFPDLIiAQGNASLRLQRVRVADEGS FTCFVS IEDFGSAAVS LQV 
AAPYSKPSMTLEPNKDLRPGDTVT I TCSSYQGYPEAEVFWQDGQ 
GVPLTGNVTTSQMANEQGLFDVHS I IjRWLGANGT YS CLVRN P V 
LQQDAHSSVTI TPQRS PTGAVEVQVPEDPVVALVGTDATLRCS F 
SPEPGFS LAQLNL I WQLTDTKQLVHS FAEGQDQGSAYANRTALF 
PDLLAQGNAS LRLQRVRVADEGS FTCFVB IRDFGSAAVSLQVAA 
PYSKPSMTI^NXDLRPGDT^ 
PLTGNVTTSO^iANEO<3IiFl)VHSII*RVVIiGANGTYS(^ 
QDAHSSVTITPORSPTGAVBVQVPEDPWALVGTDATIjRCSFS P 
EPGFSIAQLNLIWQIiTDTROLVHSFTEGR 


6514 


9B5 


302 


VGIPGPTISSAAEMBDr»I*DLOEEliRYSLATSRAKMGRRAQQESA 
QAKNHI^GKNfiSLTLTGETSSAKLPRCRQGGWAGDSVKASKFRR 
KASEBI EDFRLRPQSLNGSDYGGDI PI I PDLBEVQEEDFVLQVA 
APPS IQ I KRVMTYRDEjDNDLMKYSAIQTLDGE I DLKLLTKVLAP 
EHEVREJiNPSKQDDV^DWDHIiFTEVSSBVLl^TOPrjOl^ffiDP 
AGQARHT 


6515 


1345 


305 


GRVGSRRRGAAVPGGCGAGSTQLEVSASASCX5ALGSADMNPIW 
VHGGGAGPI S KDRKERVHOGtlVRAATVGYGIIiREGGSAVDAVEG 
AWALEDDPEFNAGOGSSVLNTNGEVEMDAS IMDGKDLSAG AVE A 
VQCIANFI KLARLVHEKTPHCPLTDQGAAQFAAAMGVPBXPGBK 
LVTRRNKKR LEKEKHK KGAQKTDCX^KNTjGTVGAVAT .DCKGNVAY 
AT3TGGI VNKMVGRVGDS PCLGAGXJYADNDIGAVSTTGHGES I L 
KVNl^U^TLFnrB(^7:<TVKEi^AJ)I>SU?YMKSRVKGI>GGLIVVSK 
TGDWVAKWTSTSMPWAAAIOXiKLHFGI DPDDTTITDLP 


6516 


1 


1402 


FRRLRYLGQDATAAARDLRTRGLOGYCPSATARQQVLVSALQQL 

KGRRSElHlNFJ^QBMPYSTNKELII^I^^VGTAGISLIXLWYHKVR 

KPGIAMKLPBPliSIiGNTFl&S ITLQDB 

LBKXNBLLTNMBBLKEBIRFLKEAIPKIjBEYIQPB^ 

I S PQHRARKRRLPTI QS SATSNS SEEAES EGG YITANTDTEEQ S 
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NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{/UAlanine, (^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Bistidine, I=Isoleucine, K=liysine, 
L=leucine, M-Methionlne , N-Asparagine, 
P=Proliue, Q=Glutaraine, R=Arginine f 
S=Serine, T=Threonine, v~Valine, 
H=Tryptophan, Y=Tyrosine, X=Unknovn, *=Stop 
Cod an, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








FPVPKAFNTRVEELNLDVXJjQKVDHLRMSESGKSESFEIIiliRDHK 
EXPRDEIEFMWRFARAYGDMYEIjSTNTQHKKHYANIGKTLS era 
INRAPMKGHCHLVTCAVIjCG™^ 

1 KLLPEK PFLYYLKGRYCYTVS KLS WI EBCKMAATLFGKI FS STV 
QBALHNFLKAEELCPGYSNPNYMYI^OTTDIiEENQKM^FC^ 
AIJXPTVTKEDKBAQKEMQKIMTSLKR 


6517 


3 


1414 


GRVWGraSSLNAWVYVRGHABDYERWQRCGAR^ 
KAOGHBLGASRYRGADGP IjRVSRGKTNHP LHCAFXjEATQQAGYP 
LTEDf^FQQECFGWMDMTIHEGKRWSAACAYLHPALSRTNLKA 
EAETLVSRVLFEGTRAVGVEYVKNGO^HPJ^YASKKVILSGGAIN 
SPQLLW.SGIGNADDLKKLGIPVVCHL 

ACTRPITIJlSAQKPI^KVCIGLEWLVnCFTOEGATAHLETGGFIR 

SQPGVPHPDIQFHFLPSQVIDHGRVPTQQEAYQVHVGPMRGTSV 

GWLKI^ANPQDHPVIQPNYLSrBTDIEBFRtiCVKLTRBIFAQE 

ALAPFRGIOMPGSHIO^DKBIUJAFVRAIM 

PSDPTAVVDPQTRVLGVElCLRVVimSIMPSMVSGl^APTIMIA 

BKAADXIKGQPAIiWDKDVPVYKPRTLATQR 


6518 


242 


1098 


PAWNPGS BPRTRVRPRARS FPLPPPRAPRRRRHRLLRAVPGPSR 

RHRCRRRAPPPPST^DAGSBRSKAPSI»PPRCPa5PWGSSKT^ 

IXSKCFADFQKKQPDDDSAPSTSNSQSDLFSEETTSEWWNTSIT 

TPTI^PSG^PLPTBLNVTSPSKEECGPCTDTAHVSLITPTKRSC 

GTDSQSE*n3^PVKRPRIiIJ3NTERSEOT^ 

KIiF^VCXJEIXSSCRCGYVFCMLHRLPEaHDCT 

KMVKIiDRKVGRS CQR IGEGCS 


6519 


3 


1113 


ERKMAEPPSPVHCVAAAAPTATVSEKEPFGKLQLSSRDPPGSLS 

AXKVRTKEKKAPRR VNGEGGSGGNS RQLQPPAAPS PQS YGS PAS 

WSFAPLSAAPSPSSSRSSFSPSAGTAVPSSASASLSQPGPRKLL 

VPPTI*LHAQPHII1jIiLPAAAAAASANAKSRRPKEKREKEKRRHGL 

GGftREAGGASREBNGEVKPIjpRDKIKDKIKERDKEKERfiKKKHK 

VMNBIKKEareEVKILLKSGKEKPKTHIE^ 

KNEKRKRPKMYSKSIQTICSGLLTDVEDQA71KGILJJDNIJKDYVG 

KtttOTWm>SKIPEWSHFPFVSI^Pl^ 

HIEHQPNGGASVrHCLQ 


6520 


3 


1113 


ERKMAEPPSPVHCVAAAAPTATV8EKEPFGKIiQL33RDPPGSLS 
AKKVRTEE KKAPRRVNGBGG SGGNSRQLQPPAAPS PQSYGSPAS 
HSFAPLSAAPSPSSSRSSFSFSAGTAVPSSASASbSQPGPRKLti 
VPPTLLHAQPiniLLLPAAATiAASANAKSRRPKBKRKKERRRFIGL 
GGARBAGGASRBENGEVKPLPRDKI PO>KI KERDKEKEREKKKHK 
VMBffilKKBNCTVKIIilJCSGKEKPrcm^ 

ENEKRKRPKMYSKS IQTlCSGI*LTDVBDQ7yUCGlI»WDNIKDYVG 
KKI^KNYDSKXPENSEFPFVSLKEPRVQim.RRLDTLEPKQLI 
HIRHQPKGGASVIHCLQ 


6521 


1B4 


1798 


KLFKyATirrSG^ELVIIPKALPLl\^QLIHADKIiGEKVSDSTMP 

Tniwwiwpwftiyiii iiMfgtfr 7»D#^wi?l?irr**FOT^T<SOt71?Q\/C*T r in7CrtI?Ml? 
I KRTVW o T KKTP r lw J\Jj/YCWjr.C &EtJ\tr IS trU 1 ii i> c.K^> v o 1 ViSJsyiSWc. 

tppatsseabqpkgepei^ekebnk^seetkkdekdqskkkekk: 
vxktipswati£asqiju^k(^pf^sprpkmmiltbaikac 

FG^CSGAS WAIPJCYI IPJCYPSLELEl^GYLLKO^^ 
KQ\H03KGA5GSPVWQKeRKTPQKSRlTOKNRSSAVDPEPQVKia 
DVLPIAPTRItCEPKEAS YSLIRJCYVSQYYPXtiRVD iRPQIiLKNA 
LQRAVERGQLEQITGKGASGTFQIiKKSGEKPIiLGGSLMEYAILS 
AIAAMNBPKTCSTTAIiKKxVIiEiniPCTNSNYQMHLL^ 

kngwmkqisgkgfsgtfqlcfpyypspgvlfpkkepddsrdede 
dedessefj^edbepppkrriqkktpakspgkaasvkqrgskpa 
pkvsa^rgkarplpkkappkaktpaketrpsstvxkkpsggss 

KKPATSARKE 


6522 


1042 


391 


NKWIAPSPRSHRTPESGRVLSLFRLPPPGMAI^^ 
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Predicted 
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nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A- Alanine, C^Cyeteina, D=sAspartic Acid, E= 
Glutamic Acid, F=»Phenylalanine, G^Glycine, 
H=Histidine, ralsoleucine, K=Lysine, 
LaLeucine, M=Methionine, N-Asparagine , 
P=Proline, Q=*31utan»ine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Trypt ophan , Y^Tyrosine, X^Onknown , *«stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ECLDY YGMLS LHRM FE WGGQLTECELE LLAFLLDEAPGAAGGL 
S RARSG LKbljbELERRGQCDE SI^RLLGQLLRVLARHDLLPHliA 
RKRRRPVS PERYSYGTSSSSKRTEGSCRRRRQSSSSANSQQGSP 
PTKRQRRSRGRPSGGARRRRRGPQPHPSSSQSPPDLPUCAK 


6523 


2 


1&97 " 


ASCQTRRRTAAIiDSGERIAGRRSPlALAMASNFNDIVKQGYVKI 
RSRKLGI FRRCWLVFKKASSKGPRRIjEKFPDEKAAYPRNFHKVT 
ELHNIKNI TRLPRETKKHAVAI IPHDBTSKTFACESELEAKEWC 
KHLCMECLGTRLND I SLGEPDLLRAGVQREQNERFNVYTJ^PTPN 
IiDIYGEClllQITHENIYLWDIHNAKVKLVMWPr^SljRRY 
WFTFESGRKCDTGEGLFTFQTREGEM I YQKVHSATLAIAEQHER 
LMLEME QKARIKJTSIiTEPMTLS KB I SLPRS AYWHH I TRQNSVGE 
IYSI^NHENRHSDLTGKSCKTSENRFLEENAPLVMYGITHHLF 
MDTSTCKWHDLE 


6524 


2 


1097 


ASCX2TRf^TAAtJ3SGBRrAGRRSP IAIiAMASNFNDIVKQGYVKI 
RSRKIiGIFRRC^VFKKASSRGPRRIjEKFPDBKAAYTRNFHKVT 
KLiHN I KNI TRL PRETKKHAVAI I PHI5ETSKTFACB SELEAEEWC 
KHLC^CLGTRIjND I SLGEPULLAAGVQREQNERFNVY 
LDIYGECTMQITHENIYLWDIHNAKVKLVMWPLSSLRRYGRDST 
WFTFESGRMCiyi^GLFTFQTREGEmYQKraSATLAIAEQHER 
LMLEMBQKARLQTSLTE PMTLSKC I SI>PRSAYWHH I TRQNSVGE 
IYSLQG^BNRHSDLTGKSCKTSENRPI^BWAPLVMYGITHffl^P 
MDTSTCKWHDIiE 


6525 


1 


1859 


GESPFSEBESIEFNPSSSGRSARTVSSNSFCSDDTGWPSSQSVS 
PVKTPSDAGNSPIGFCPGSDEGFTRKKCTIGMVGEGSIQSSRYK 
KBSKSGLVKPGSEAJDFSSSSSTGSISAPEVHMSTAGSKRSSSSR 
NRGPHGRSNGASSHKPGSSPSSPREKDLLSMLCRNQI^PVNIIIP 
SYAPSSPSSSNSGSYKGSDCSPIMRRSGRYMSCX3ENHGVRPPNP 
EQYLTPrOQKEVTVRHT,KTKXKBSBRRiaBRE3EIVELKSQLAR 
MRKDW I EEE CIIRVEAQLALKEARKE I KQLKQVT ETMRSS LADKD 
KGIQKYFVDINIO^KKI,ESl^SMKMAHSGSIiRDEIiCl>DFPCDS 
PEKS LTLNPPLDTMAIXJIiSLEBQVTGEGADREIiLVGDS IANSTD 
LFDBIVTATTTESGTII^EIjVHSTPGANVI^LPIVM^EEGSVVV 

eravqtdwpyspaiseliqsvlqklqdpcpsslaspdesepds 
MBS fpeslsalwdltprmpnsai li*s pvetp yanveaevhanr 

I^EU)FAAC\reERIJX3VIPr^GGVVRQYWSSSFLTO 

PWPTVLWAFSTQRGGTDPVYITCGA^ 

IKT 


6526 


2 


2034 


sgragepeewrgrqiidsketwipfnsedsqqleeayssgkgcn 
gr wp tixsgrydvhiigbrmryavywdelaskvrrctwfykgdkd 

NKYVPYSESFSQVI^BTYMrAVTrJDEWKlCKLESPNPJSlI ILIINP 
KLMVHYQPVAGS DDWGST PMEQGRPRTVKRGVEN ISVDI HCGEP 
LQIDHLVFWHG IGPACDLRFRSI VQCVNDFRSVSU^QTHFK 
KAQENQQIGRVE PLPVNHHS PLHSTGVDVDLQRITLPSINRLRH 
FTNITriLDVFFYNSPTYCOTIVDTVASKMNRIYTLFI^^ 

mreTifuoT/'QT tt ttt\tt Tvtni/ncT yi^TT^owttyoT ww~*nLms\/-*Y*, 
\jGvo XAJL»}lblitjo lii LitlJl L> x NUiUJbiA^lJlUoKiuJoijNIVMDQGD 

TPTLEBDLKKLQLSEFFDI FEKEKVDKEALALCTDRDliQE I G I P 

LGPRKKILNYFSTRKNSMGIKRPAPQPASGANIPKESEFCSSSN 

TRNGDYLDVGI GQVSVKY PRL. I YKPE I FFAFGS P IGM FLTVRGL 

KRIDPNYRFPTCKGFFNIYHPFDPVAYRIEPMVVPGVEFBPMLI 

PHH^RKRMHIjBI4{KGLTRMSMDLKNNljLGSXiRMANKS FTRAP Y 

PAIXJASErPEETBABPBSTSEKPSDVNTBBTSVAVKEKVLP INV 

GMLNGGQRIDYVLQEKPIES FNEYLFALQSKXCYWESEDTVLIiV 

LKEIYQTQGIFLDQPIO 


6527 


1 


922 


GWVPLLSRILPSDACKIYKQGINIRIJXmiDFTDMKCQRGDLS 
FIFNGDAAPSF^FVVLDWEQKVYQRIHHEESEMETEEEVDILMS 
SDI YSATLSTKS IS FTRAQTGWLFliBCKTERVGNFIiADFYLVNG 
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Glutamic Acid, F^Phenylalanine, G-Glycine, 
H^Histidine, X^Isoleucine, K=Lysine, 
L^. Leucine, M=*Methionine, N=Asparagine , 
P= Proline, Q=Glut amine, R^Arginine, 
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W=Tryptophan, Y=Tyrosine , X=Unknown, *=Stop 
Codon, /=possifcle nucleotide deletion, 
\=poseible nucleotide insertion) 








LVLESRKRRBHLSBBDILRNKAIMBSLSKGQNIMEQNFEP1RRQ 
SLTPPPQNTITW2EyiSAENGKAPHI»GRELVCKESKKTFKATIA 
MSQBFPLGIELLLfTVLBWAPFKHFlillC^ 

D I PVF PTITATVTFQK FR YDS FDG S I FTI PDD YKEDPSRFPDL 


6528 


1 


1073 


liTGPAAAEPRCAAimGMiaRAIfGRRKGVWLRL^ 
IPFLIKLCPGIQAIOjIFLNFVRVPYFIDLK^ 
LQPEEDVTIGVWHTVPAVWWKNflQGKDQMWYEDAIJSSSHP^ II>Y 
LHGNAGTRGGDHR VBLY KVL3S LGYHWTFDYRGWGDSVGTPS E 
RGMTYDAI^HVFDWIKARSGDNPVYIWGHSL^^ 
RETPPDALII^S?FTNIREFJVKSHPFSVXYRYFI>GFDWFFLDPI 
TSSGXKFANDENVKHISCTLLILHABDDPVVPFQLGRXI.YS IAA 
PARSFRDFKVQFVPraSDLGYRHKYTYKSPFiPRILRBFLGKSE 
PBHQH 


6529 


363 


2215 


n3IRYNKIGVVKTMSCX^EFVET]^IGYPKAr)NLNGEDFDWI.F 
EGVBDESFI^WFCGNV^imiSEREIiBAFSII^KSG^Il^GA 
AT 1 DRALKTCKTSDLKTPRLDDK^BKI^DFAOTIJjKLKNT ) KIQR 
RNKGQLMASVTSHKSLRLNAKEEBATKKL^^ 
NELQALTDEVTQLJ'&'IF FRHSNLGQGTNPLVFLSQ FSLEKYLSQE 
EQSTAALTLYTKKQFFQGIHEVVESSNKSQFFNFIjKIQTPS XCD 
NQE I LEERRLBMARLQLAY I CAQHQLIHLKASNSSMKSS I KWAE 
ESIJ{SLT5KAVDKENIiDAKISSLTSEXMKIjEXEVTQXKDR$I>PA 
WRENAQLLNM P VVKGDFDLQIAKQDYYTARQE L VLNQL I KQKA 
SFBLLQI>SYEIFJJ^KHRDIYRQIjENLtVQEIiSQSNMMLYXQLEML 
TDPSVSQQINPRNT I DTKD YS THRLYQVLEGBNKKXELFLTHGN 
LBEVABKLKQNI 9LVQDQLAVSAQEHS FFLSKRKKDVDMLCDTL 
YQGGNQLI,LSMELTEQFHKVESCffJ«CUmi^ 
TLANimLH^BREFYVY FLKDEDYLKDI VENLETQS KI KAVS LS 
D 


6530 


128 


2986 


GAAHHGAIVQVHPLLPGSSTIMIHDLCLVFPAPAKAVVYVSDIQ 
ELYI RWDKVE I GKTVKAYVRVLDLHKKP FLAK YFP FMDLKLR A 
ASPIITLV7^LDEALDNYTITFLIRGVAIGQTSLTASVTNKAGQR 
INSAPQQIBVFPPFRLMPRKVTlilGATMQVTSBGGPQPQSNIL 
FS ISNESV7u^VSAAGLVQGLAIGNGTVSGLVQAVDAETGKVVI I 
SQDLVQ VEVLLLRAVRIR AP IMRMRTGTQ MP I YVTG I TNI IQN P F 
S FGNAVPGLTFHWS VTKRDVLDLRGRIffi 

LGRVKGRTGLRAVVKAVDPTSGQLYGLARBLSDB I QVQVFEKLQ 
lAiTOBIEAEQII^PNSYIKLQTNRIXyUVSI^ 

YLRVSMS ?VIJITC2NKFJu^VAVPLGMTVTFTVIIFim 
SSN^FATNRDDFVQIGKGPTNNTCVVRTVSVGLTLLRVWDAKH 
PGI^DFMPIiPVLQAI S PEL^GAMWGDVLCLATVLTSLKGLSGT 
WSSSANS ILHXDPKTGWVARAVGSVTVYYEVAGHLRTYKEVVV 
SVPC^IMARHLHPIQTSFQBATASKVIVAVGDRSSNLRGECTPT 
QREVIQALHPETLI SCQSQFKPAVFDFPSQDVFTVEPQFDTALG 
QYFCS ITMHRLTDKQRKHI^SMKKTALVVSAS LSSSHFSTKQVGA 
EVPFS PGLFADO^E ILLSNHYTSSE IRVFGAPEVLBNLEVKSG3 
PAVLAFAKEKBFGWPSFITYTVGVLDPAAGSQGPLSTTLTFS3P 
VTNQAIAI PVTVAFVVDRRGPGPYGASLFQHFLDSYQVMFFTLF 
ALLAGTAVMI IA YHTVCTPRDLAVP AALT PRASPGHS PHYFAAS 
SPTSPNALPPARKASPPSGLWSPAYASH 




845 


1425 


PSASIPPSASPDPVPDIRTCHFCLVEDPSVGCISGSEKCTISSS 
SLCMV IT I YYDVKVRFJ VRGCGQY I S YRCQEKRNTYFAE YWYOA 
QCCQYDYCNSWSSPQLQSSLPEPHDRPLALPLSDSQIQWFYQAL 
NLSLPLPNFHAGTEPDGLDPMVTLS LNLGLSFAELRRMYLFLNS 
SGLLVLPQAGLLTPHPS 


6532 


2 


954 


AAGPPSEWNQDS LFPE PEPGPAPQVL1GPQGPGLIKGVAPPTL 
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ITDSTGTHLVLTVTHKNAHSPGLSRGSPQQPSSQPGSPAPAPSA 

QMDLBHPIjQPI.FGTPTSLLKKEPPGYEEAMSQQPKQQENGSSSQ 

QMDDLFDILIQSGBISADFKEPPSLPGKKKPSPKTVCWSPLAAQ 

PSPSAELPQAAPPPPGSPSLPGSLBDFLBSSTGLPLLTSGHDGP 

BPI£I,IDDI^QMLSSTAIIJ>HPPSPM1^^ 

DIJUX5HLDSMDWI^1^SGGPVI£IAPLSTT^ 

IiQLHWDSCL 


6533 


1798 

- 


373 


STISWI^VEPPRRSSGVGAARLRPPGGSRPURARACVIiAlA^ 
ALLEJRjNNADSMSAHSMLCERIAIAKKLIKRAKSLSRSRKGCIKG 
GAKIXrSKbKTvEI.KFIOK^KAGKVAIK^SHLQSTNLTIILRAIVES 
AENLEEWSTVLHVEGY/nJTI^ 

KAEALHNI WIX5RGQ YGDKS 1 1 EQAEDFLQASHQQPVQ YSNPH 1 1 
FAFYNSVSSPMAEKLKEMGISVRGDWAVNALIJ5HPEBLQPSBS 
ESDDEGPELLQVTRVDRKHILASVAFPTBIKVWCKRVNLDITT 
LITVVSALSYGGCHFIPKBKVLTKQAEQBRKEQVLPQLEAFMKD 
KELPAOSSAVXDFQSIIJDTI^PGKRERATVLIKR INWPDQPS 
ERAIiRLVASSKIHSRSLTI FGTT3DTIiKAITMTANSGPVRAANNQ 
GVKFSVFIHQPRAIjTESKEAIiATPLPKDYTTDSEH 


£534 


47 


596 


IOV!n^ISAA^VVI^CC«VSPAW^ 
EKSLQCFPCSAMLLEI»IPL1X5IHFVI>RTARA0SVT^ 
EGASU3LRCN YS YGATP YIiPWMERTVEEAPI EiLVCLKPWRVAS S 
LEKKEKEDESFQI^IiG3RYNVTiKAHCIiLPLIRWLTSGDSLLSAQ 
PHCPQGL 


6535 


250 


964 


LIKTFFRDVAIQRDIjLPKBKNLETLtTIAPliEIDKAPSSHARLS 
ADATTJjTSGTTATVALIiRDG Z EL WAS VGDSRAILCRKGKPMKL 
TIDHTPBRKDEKERI KKCGGFVAWNSLGQPHVNGROLAMTRS IGD 
LDLKTSGVIABPETKRIKLHHADDSFIiVLTTDGINFMVNSQEIW 
DFVNQCHDPMEAAHAVTEOAI QYGTRDNSTAVWPFGAWGKYKN 
SBItfFSFSRS FASSGRWA 


6536 


242 


1174 


5I»VKEMTNQ YGILFKQEQA1JDDAIWS VAWGTOTCKENS BTWTGS 

LDDLVKVWKWRDBRLDIiQWSLEGHQLGVVSVDISHTLPIAASSS 

LDAH I RJjWDLKNGKQ I KS I DAGPVUAWTIaAFSPDSQ YIATGTHV 

GKVNI FGVESGKKEYSIjyroGKFII^IAYSPIXSKYLASGAIDGI 

INIFDIATGKLLHTIJ2GHAMPIRSLTFSPDSQLLVTASDDGYIK 

lYDVOJJAlWAGTI^HASWVLNVAFCT 

DVGTRTCVHTFFDHQDQVWGVKYl^ 

PI 


6537 


1638 


921 


WRFNPPPTQGPPPSLVYRPDVDPEVAKDKASFRNYTSGPIXDRV 
PTTYIOjMHTHQTVDFVRSKHAQFGGFS 

IALFGEPQWAWGDT FPVGCRPOASVVFCBSTFQDNPDLQDPRY 
STEI^YQPHCGLDRVLMSWGHDGEARGGQWGGGGRWGTVGGGG 
AEAVPAGDTLS PQSTCTR 


6538 


3345 


2412 


P YIiYDFLDAL I TCQTAPE EAF I IGjIXSLAC^LTEQIjRJRLTKQVQK 
ARHNRDDEAIICKAVNEYDB1MEKY1PVUJAQAOT 
EKI PRRSVBFCNDHDVWKLNVAHVL 
KHYDNII^SAIVLANLCVSYIMTSQNEKAEEI^^ 

syddpnrkmyhlcivnlvigtlycakgnyefgisrvikslepyn 

KKLGTDTWYYAKRCFLSLLKNHS 

BLYGTNIPAVIEQPLEEERMHVGKNTVTDBSRQLKAIj I YE I IGW 
NK 


6539 


218 


339 


flgaasphphfsslaphpdqpeftpvojjeleamelwgpgv 


654Q 


3 


391 


LB^WmJ^PEDAMAECPT^ 

kqhawlpltieikdrlqlrvijurredwlgrpmtptqigpsixp 

I ^QLYPIXSRYRSSDSSFHRJCjVYHI KIDGVBDMLLBLLPDD 
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Codon, /^possible nucleotide deletion, 
\=po3sible nucleotide insertion) 


6541 


1165 


536 


RTIiVQRRI LMIjLRKPARGRDLRGRGRGTPRGGRKGLLiPTPDB F P 
RFEGGRKPDSWDGNRBPGPGHKHFRDTPR PDHPPHDGHSPASRE 
RSSSLQGMDMASLPPRKRPWHDGPGTSEHREMKAPGGPSEDRGG 
KGRGGPGPAQRVPKSGRSSSLDGEHfflXSYHRDEPFGGPPGSGTP 
SRGGRSGSNWGRGSNMNSGPPRRGASRGGGRGR 




3 


3775 


SWPRGRGETGGHPGALRTRTMQKSVRYlfBGHALYr^FIiARKEGT 
KRGFLS KKTAEASRWHEKWFALYQNVLFYFEGEQSCRPAGMYIiIj 
EGCSCBRTPAPPRAGAGQGGVRDAI»DKQYYFT\a^GHEGQKPI»E 
LRCBEKQIXJKSWMEAIHQASYADILIEREVLMQKYIHLVQIVKT 

eki aanqlrhqledodtei BRLKSH I ialnktkermrpyqsnqe 

DEDPDIKKIKKVQSFMRGWJW3RRKWKTIVQDYICSPHAESMRKR 
NQIVFTMVEAESBYWQLYILVHGFIjRPLRMAASSKKPPISHDD 
VSSIFIjNSETIMFTiHEIFHQGLKARIANWPTLILADLFDILLPM 
I^IYQEPVRNHQYSJX2VI^CKQNRDFDKIiLKQYEANPACEGRM 
IjETFIjTYPMFQ I PRY I XTLHEIjLAHTPHKHVERKSOSFAKSKr.B 
EI^RVMHDEVSDTENIRKIILAIERMI VEGCDI IiLDTSQTFIRQG 
SLIQVPSVERGKI^KVRI^SI^LKKI^KRC^FI^KIIFLICTRS 
SGGKLHLIJCraGVI>SLI DCTL IE E PDASDDDS KGSGQVFGHU)F 
KI VVEPPIJRAAFTVVLLAPSRQKKAAWMSDI SQCVDNIRCNGLM 
TIVFEENSKVTVPHMIKSDARLHKDDTDICFSKTI^SCrKVPQIR 
Y7VSVERLIJ8Rl,TDI^FLSIDFLNTFlJn^IFTTAAVVLGKLSD 
IYKR PFTS I P VRSLBLPFATSQNNRGEHIiVDG KS PRLCRKFSS P 
PPLAVSRTSS PVRARKkSIiTSPLNS KIGALDLTTSSSPTTTTQS 
PAASPPPHTGQIPLDLSRGLSSPEQSPGTVEENVDNPRVDLCNK 
LKRSIQKAVl^SAPADRAGVESSPAADTTELSPCRSPSTPRHIJR 
YRQPGGQTADNAHCSVS PASAFAIATAAAGHGS PPGFNNTERTC 
DKJBFI IRRTATTOVLN\n,PJlWVSKHAQDFEIiNNE 
EVI»RDPDIiPQEPJCAAANIIJ4AI£QDI^DD^ 
MKAECFESLSAMELAEQ I TliLDI IVI FR3 IPYEEFLGQGWMKIiDK 
NERTPYIMKTSQHFNDMSNLVASQIMNYA0VSSRANAIEKWVAV 
ADICRCLHNYNGVIiEITSAIiNRSAIYRLKKTWAK^SKQ'™ 
KLQKTVS SEGRFTCNIjRETI»KNC!N P P AVPYLGMYLTDLAFI EEGT 
PNFTEEGIiVNFSKMRMISHI IREIRQFQQXSYRIDHQPKVAQYL 
LDKDLI IDEDTLYELSLKI EPRLPA 


6543 


1857 


950 


FVSGCGRAGIGl^WAMAABARVSRWYR3GLASCX3AAan^ 
LKVHLQTQQEVKLPJITCMAIiR^ 

YSLTRFA1YETVRIJRVAKGSC^LPFHEKVI^SVSGI>AGGFVG 
TPADI»WVRMQ1TOVKLPG^QRRNYAHAI^^ 
SGATMASSRGAliVTVGQLS CYBQAKQLVLSTCYLSDNIFTHFVA 
S FIAGG CATFIKXiPLDVLKTRLMNS KGEfQGVFHCAVKTAKLG P 
LAFYKGLVPAGIRIjIPHT^TFVFLEQLRKNFGIKVPS 


6544 


630 


79 


pspcfirsrijxsqpwmaglfanlsqnfsjlhqpqshvrvrras is 
epsdtdpeprtlnpspagwfvqqhpeiielmssfrerfgfjwlqy 
rshlepsgnplpatpttsapsappassqgpdtaprpsppqbear 
gpqespqkmseevraepqeeekekegkekkeegemaplpeaht^ , 
egkqkecp 


6545 


176 


560 


PPHSHAALLPAAMTPLLTLILWU^LPIiAQAIJ3CHVCAYNGDN 
CFNPMRCPAMVAYCMTTR'J^YTPTPJ4KVSKSCVPRCFETVYDGY 
SKHASTTSCCQYDI^3KmJIAT^^ 


6546 


1657 


364 


HIJ^I^EVAAFFWDI^rVRKHFCFLKCLPRVRPFYAVKCNS 
SPGVLKVIAQIXS^FSO^KAFifEliVQHIGIPASKIICANPCKQ 
IAQIKYAAKHGIQLLS FDNEMFJJUOATKSHPSAKMVLCIATDDS 
HS I >5CLS LK bWSLKSCRHLLElJAKKHHVEVVGVS FHIGSGCPD 
PQAVAQS I ADARLVFEMGTELGHKMHVlJ)tiGGGFPGTEGAKVRF 
EEIASVINSALCLYFPEGCGVDIFABI^RYYVTSAFTVAVSIIA 
KKEVLJjDQPGRBEENGSTS KTIVYHLDEGVYGI FNS VLFDNICP 



520 



WO 01/53312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Aianine, ^Cysteine, D=Aspartic Acid, E=> 
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S^Serine, T=Threonine f V-Valine, 
W=Tryptophan, Yt-Tyroeine, X-Unknown, *»Stop 
Codon, /^possible nucleotide deletion, 
\=*possible nucleotide insertion) 








TPILQKKPSTEQPLYSSSLWGPAVDGCDCVAEGLWLPQLHVGDW 
LVFDNMG AYTVGMGS PFWGTQACH I TYAMS RVAWEALRRQLMAA 
EQBDDVEGVCKPLS CGWE I TDTLCVGPVFTPAS IM 


6547 


l 


541 


IMS KYIAPAI,CSQPGMMRCaUlRCXra?QPPHALRPLIiI^ 
PPIJVAAAAGPNRCDTIYCX3PAE<ILIRIX3DSMGRGGEIiETICI^W 
NDFUACASQVLSGCPEEAAAVWES LQQEARQAPRPNNLHTLCGA 
P VHVRERGTGS BTNQBTI^ATAPALPMAPAPPLIiAAAlAIAYE*L 
RP1A 


6548 


2 


219 


FVSRLS VRDVRFPTFLGGHGADAMHTDPD YSAAYVP I ETDABDG 
IKGCGITFTLGKGTEVGELKILSRFQNA 


6549 


73 


1490 


ETGRVCEDARPACGSRS RRRRKEAAP3I PTPS PSS SS PTSSRPA 

ARAFSKAPARIiSRPRAREEPPDPGRRYIQBEI IQARKHKI.IKMC 

SSVAAKLWFLTDRRIREDYPQKEILRALKAKCCEEELDFRAVVM 

DEVVI.TIEO^NICI»RIHGBLITAYPQVVVVRVPTPWVQSDSDIT 

VIiRHIjEKMGCRLMNRPQAIItfCXTO 

YGGH3NPAKMIDBABVLBFPMVVKOTRGHRG1^ 

DI^HLIRHEAPYIJTOKYVKESHGRDVRVIVVGGRVVGTMLRCST 

DGRM0SNCSLGGVGMMCSI^EOGKQIiAIQVSNIIX5MDVCGIDIiL 

MKDIX5SFCVCRANANVGFIAPDKACNLDVAGI lADYAASLLPSG 

RLTRRMSLLSVVSTASET5BPELGPPASTAVDNMSASSSSVDSD 

PESTBRELLTKLPGGLrNMNQLIANEIKLIiVD 


6550 


2293 


922 


FRVSRO^PDCGIEQMGLAMBHGGSYARAGGSSRGCWYYLRYP^ 
LFVSIiIQFLI II^VLFMVYGNVHVSTESNLQATERRAEGLYSQ 
LLGLTASQSNLTKKLNFTTRAKDAIMQMWLNARRDLDRINASFR 
QC^GDRVI YTNNQRYMAAI ILSEKQCRDQPKDMNKS CDAIiL FML 
NQK^HCPLEVEXAKEKTICTKDKBSVIiIjNKRVABE^ 
I^HQERQLAK^QLQKVOALCLPLDKDKFEMDLRNLWRDS 1 1 PRS 
LDNl^YNLYHPLGSBIiASIJlRAa)HMPSr^SKVEEIiARSLR^ 
IERVARENSDLQRQKLEAQQGLRAS QEAKQKVE K KAQAREAKIjQ 
AE CSRQTQLALEEKAVIJ? KERONIAKEl^EKKREAEQLRMELAI 
RNSAIJyrCIKTK^QPMMPVSRPMGPVPNPQPlDPASLBBFKRKI 
LESQRP PAGI PVAPSSG 


6551 


157 


74Q 


r QPPDPRNMTLAAYKE KMKBIiPLVS LFCSCFLADPIiNKS SYKYE 
ADTVDLNWCVISDMEVIBIJJKCT^ 

ASLPHRRDPSLEEIQKKLKAAEERRKYOHABIjLKHIiAEKRKHER 

eviqka r eennnf i kmakeklaqkmesnkrnremi laaml.er lq 
kkdkhaeevrknkelkekasr 


6552 


157 


748 


IQP PD PRNMTIiAAYKEKMKEL P LVS LFCS CFLADP LNKSS YKYE 
ADTVDUWCVISDMEVIELEnCCT 

ASLPPJo^DP5LEEIQKKIiKAABERRKYQEAEI.TiKHIiAEKEEK^ 
EVIQKAI EENNNFI KMAKEKIiAQKMESRKENR^AIUiAA>tLBRLQ 
EKDKHAEEVRKNKELKEEASR 


6553 

• 


2 


1807 


ITVWSKMAAHI^YGEVNLNVLRKAVR 

B YLTGPreLIAQYSLLKEHEVBKMFTLKGNRLPAAWl^ I FFV 

RPRXELMDI IAKNVLSKDRRGP TRDFHI L PVPRRS LLCEQRLKD 

LGVIX^PIHREBYSLDLIPFIXJDIiI^MESEGAF^ 

LYHAAKGIi>frLG>\LYGTIPQIFGKGECARQVA^ 

SONS I FPVFim^r£)RNVDIiLTPIiATQLTYEGLIDEI YGIQJMS 

YVKLPPEKBAPKJCQGDGGKDLPTEAKKLQIiNSAEKIiYAEIRDKN 

FNAVGSVLSKKAKI ISAAFEERHNAKTVGEIKQFVSQLPHMQAA 

RGSLANHTSI AELI KDVTTSEDPFDKLTVEQEFMSGI DTDKVNN 

YIEDCIAQKHSLIRVt^LVCLOSVt^SGIiKQKVLDYYKREILQT 

YGYEH I LTLH^EKAGLLKPQTGGRNNYPT IRETLRLWMDDVNB 

QNPTDI S YVYSGYAPLS VRLAQLLSR PGWRS I EEVLR I LPGPIfF 

BBRQPLPTGLQKKRQPGBITOVTLIFFLGGVTFAEIAALRFI^QL 

HX^EYVrATTKLMNGTSWIBAbMEKPF 
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ID 
NO: 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
ires i due of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
Sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F^Phenyl alanine, G=Glycine, 
H=Histidine, I«Isoleucine, K> Lysine, 
i>=Leucine, M=Methionine, N=Asparagine, 
P-Proline, Q=Glutamine, R«Arginine, 
S=Serine, T=Threonine, V=*Valine, 
W-Tryptophan, Y=Tyrosine, X=>Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion) 


6554 


119 


1244 


FBtWSQVSVESGALHWIVGGGFGGIAAASQLQAIOTPFMLVDM 
KDSFHHNVAALRASVETGFAKKTFISYSin'FKDNFRQGLVVGID 
LKNQMVLLQGGEALPFSHLIIA^ 

AYEDM VRQVQRSRF I WVGGGSAGVEMAAE 1 KTE YPEKEVTLIH 

SQVAIJUlKBLI,PSWQKVKEIIiRKGVQIJJ,SERVSNIJBELPLN 

BYI^IKVQTDKGTKVATNLVII^^IKINSSAYRKAFBSRIAS 

SGAUWNEHLQVBGHSNVYAIGDC^VR^ 

ANIVNSVKQRPLQAYKPGALTFIJ>SMGRNDGVGQISGFY^ 

VRLTKSRDLFVSTSWKTMRQSPP 


6555 


1552 


498 


IHMALU^KIMQVLLFLLIVTLCVILYKKVHKGTVPKNI^ 
TPEELEEEI PVVICAAAGRMGATMAAINSI YSNTDANILP^ 
LHNTLTRIRKWI BHSKLRE INFKI VE FNPMGLKGKIRPDS SRPE 
r>LQPLNFVRFYLPLLIHQHEKVI YLDDD VrVQGDIQELYDTTTiA 
IX3HAAAPSDIX^I,PSAQDINRIiVGI<^^ 

I S PSTCSFNPGVI VAIWTEWKHQR ITRQLEKWMQKNVEENLYSS 
SIjGGGVATS PMi»rVFHGKYSTINPI*WHI RHLGWNPDARYS EHFL 
QKAKIiLB^NGRHKPWDFPSVHNDLWESWFVPDPAGIFiaLNHH^ 


6556 


241 


1449 


ASLCJOK:FVVailVL^IILPSLQSPPTFGFTiIJ3IIX5VLVRGHRVl 
PAALKAFRRLVNS QGQLRVPVVFVWAGNILQHSKAQinjSAIiI^ 
CEVDADQVILSHSPMKLFSEYIffiKRMLVSG^ 
N\mVDELRMAFPXJ^Di^LERI«JOT 

BP VRWBTSLQI* IMDVLLSNGS PGAGIATPPYPHLPVLASNMDLI* 
WMAEAIOTORFGHGTFLLCLET IYQKVTGKKLR YEGLMG KPS ILT 
YQYAEDLI RRQAE RRGWAAP I R KL YAVGDNPMS DVYGANL FHQ Y 
IX3KATHDGAPBLGAGGTRQQQPSASQSCISILVCTGVYNPRNP0 
STBPVLGGGEPP FHGHRDLCFSPGLMEASHWNDVNEAVQLVFR 
KEGWALE 


6557 


2598 


1534 


RMCGRTSCHLPRDVLTRACAYQDRRGQQRLPEWRDPDKYCPSYN 
KSPQSNSPVLLSRIiHFEKDADSSBRIIAPMRWGLVPSWFXESDP 
SKLOPNTTNC^DTVMEiaiSFKWI^KGRR 
QGTNQRQPYFIYFPQIK-I^KSGSIGAADSPENWKKVWIMWRLLT 
MAGIFDCWBPPBGGDVLYSYTI ITVDSCKGLSDIHHRMPAILDG 
BBAVS KWLJDFGBVSTQEALKLI HPTBNI TFHAVSS WNNSRNNT 
PECXAPVDLVVKKEIjy^SGSSQRMIiQWIiATKSPKKEDSKTPQ^ 
ESDVPQWSSQFLQKS PLPTKRGTAGLLEQWLKREKBKBPVAKRP 
YSQ 


6558 


21 


1138 


FHGRRRGGRKWELGSCLKGGRKAABEBGEPKVKKRRIiLCVKFAS 
VAS CDAAVAQCFLAKNDWEMER I 
SBPKTYVDLTNEBTTDSTTSKISPSKDTQQEB^GSMFSLITWNID 
GLDLNNLSERARGVCS YLALYS PDVIFLQBVIPPYYS YLKKRS S 
NYE 1 1 TGHEEGYFTAIMLKKSRVKLKSQBI I PPPSTKMMRNLLC 
VHVNVSGNBIiCIiWrSHLESTRGlIAABRMH^ 
ATVTFAGDTNLRDRFA/TRCXSGIiPNNIVW 

TXJMNSNLGITAACKLRFDRIFFRAAAEEGHI I PRSLDLLGLEKL 
DCGRPPSDHWGIitiCWIiDIIL 


6559 


3 


364 


GPBLSGIiPTRPKKI*KANOTPIAMIX:CMRSCSVPTGPATTICSS 

DKSCRO^CLPSTCPHTWnj^PTCCntf^ 

IJ^SCQPTPGLETXiNLTTFTQPCCEPCLPRGC 


6560 


3 


1435 


TATSGGI WLRRKWRCHWPRPLPQS CVGTEGGLQVRDTSSRXAKG 
GVDHTKMSLHGASGGHERSRDRRRSSDRSRDSSHERTESQLTPC 
IRNVTSPTRQHHVEREKDHSSSRPSSPRPQKASPNGS I SSAGNS 
SRNSSQSS S DGS CKTAGEMVFVYENAKEGARNrRTS BRVTLIVD 
NTRFWDPS I FTAQPNTMI/3RMFGSGPJKHNFTRPNEKGEYEVAE 
GIGSTVFRAI LDYYKTGI IRCPDGISIPBLREACDYLCISFEYS 
TIKCRDLSALMHEI^NDGARRQFEFYLEEMILPLMVASAO^ 
ECHIWLTDDDWDWDBEYPPQMGBBYSQ1IY3TKLYRFFKYIE 
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SBQ 
ID 
NO: 


Pred i C t ed 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
se cpi ence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amine* ar*i rl fl^cffrw*rit" ronl-ainSno nioriAl Tn^rit-i 
(A-Alanine, C=Cysteine, D»Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, I-Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine, 
P ^Proline, Q^Glut amine, R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /=possible nucleotide deletion. 








NRDVAKSVLKKRGLKKIRLGIEGYPrYKKKVKKRPGGRPKVIYN 
WQRPFIRMS^KEEGKSRHVDFQCVKSKSITNLAAAAADIPQD 
QLVVMHPTPQVDBLDILPIHPPSGNSDLDPDAQNPMIj 


6561 


3 


1086 


PGRRFRRKKSSSSRWFPADCLLGrj?GPASSLLSPEPSPSWPSHS 
PCPMAALTDLSFMYRWFKNCNLVGb^EKYVFITGCDSGPGNLL 
AIGQLVDRCTMQVLAACFTEBGSQK^^ 

I KAAAQWVRDKWEQGI.WALVNNAGVGIiPSGPNB^TKDDFVKV 
INVNIiVGLI 2VTI/HMLPMVKRARGRVVNMSSSGGRVAVIGGGYC 
VSKPGVEAP3DSI RREIiYYFGVKVCI IEPGN YRTAILGKENLES 
RMRia^ERLPQETPJ>SYGEDYPRiyTDKIiKNIMQYAKPRVRDVI 
NSMEHAIVSRS PR IRYNPGLDAKLLY I PIAKLPTPVTDFUiSRY 
IiPRPADSV 


6562 


1 


1562 


MSTtiYDIRAHKAQLIJlFPASSDSNKALBQRRTLHTPKLEHLDRV 
LYEWFI^KP^EGVFVSGPMLIBKAKDFYBQMQLTBPCVFSGGWIj 
WRFKARHGlKiOjDASSEKO^ADHQAAEQFCAFF^SIiAAEHGLSA 
EQVYKADBTGLFWRCI^NPTPEGGAVPGPKQGKD 
TGSHPJUKPIAIGKCSGPRJ^^IQHUF^YKAQGNAWVDKBI FS 
DWFHHI PVPS VREHFRTIGLPEDSKAVItLLDSSRAHPQEAELVS 
SNVFTIP^PASV^LVQPMEG^lRilDFMP^INPPVpLQGPHAR 
YNMNDAI FSYACAWNAVPSHVFRRAWRXLWPSVAFAEGSSSEEE 
I^AEC^VKPHNKSFAHII^VKBGSfiCPGQLP^RQAASWGVAG 
REAEGGRPPAATS PABWWSSE KTP KADQDGRGDPGEGEEVAVfE 
OAAVAFnAVTjRFAERQPCFSAQETGQLRALRAVFRSQQQVRRRR 
GALGAVVKVEALQEGPGGCX5ATAQSPLPCSSTAGDN 


6S63 


1319 


2694 


LARPAQPVLl^PEGAGPPVPAGHIiVHHIWGHljREJRAHPDJaEA 
HEHPLPCDQM FNRG#GGHLRMVE AN5RGVVWG IG YDHTAWVYTG 
GYGGGCFQGLASSTSNIYTO^DVKCVHIYENQRWNPVTGYTSRG 
LPTDRYMWSnASGLQECTKAGTKPPSI^AWSDWPVDPSVPGG 
TDQEGWQ YASD FPAS YHG S KTMKD FVRKRC W ARKCKLVTSGPWI. 
BVPPIALRDVS 1 1 PES PGAEGSGHS IALWAVSDKGDVLCRLGVS 
KLtN Pi\Ofc»£iW I*H Vi^TUQP r AoX J> I (j AC x ^ V nAVAKlXiSAFYRGSV 
YPSQPAGD CVfYH I PS P PRQRLKQVSAGQTSVYALDENGNLWYRQ 
GITPSYPC^SSWHHVSNNVCKVSVGPL^ 

RCTVCmiGTOPHEPKGHGWDYGIGGGWDHISVRANATRAPRSS 


6564 


1 


975 


APGSCAEtf^CGRCWSRAmGCQIJXJIJ^^ 
KRAABTHtGFETVSEKEKGGKVYQVFESVAKKYDVMNDMMSLGI 
imVfflOH^WKMHPLPGTQIiLDVAGGTG^ 
KQKRQLRAQQNLSWEEIAIOSYQNEEDSIjGGSRVV^^ 
VGKQKAIAQG YRAGLAWVIiGPAEEItP FDDPKFDI YT1AFG I RNV 
THI DGALQRAHRVLKPGGRFIjCLB fsqvnnpl I srlydlys FQ V 
!PVIGE\nAGDWKSYQYLVBSIRRPPSQEEFKDMIEDAGFHKOT 
YESLTSGIVAXHSGFKL 








6565 


1464 


999 


RSAVANGLTKRRMGLKLNGR Y I SL I LAVQ 3AYLVQA VRAAGKCD 
AVFKGFSIXXLKIjGDS1«NYP<JGIiDD 
VTALTDCX)EGAKDMVflPKIiRKES KN^ 
LLPAFPVLLVS I*S AALAXWIjS F 


6566 


3 


1385 


KYESAQPGGTQPEPGLG^MAIHKALVMCIGLPLFLFPGAWAQG 
HVP PG CS (^LNPbYYNLCDRSGAWG I VLEAVAGAGI VTTFVLT I 
ILVASLPFVQDTKKJ?SLLGTQVFFLI^TU?LFCLVFACVEKPDF 
STCASRRFLFGVLFAICFSCIJ^IVFALNFLARKNHGPRGWIF 
TVAXXLTLVEVI INTKWLIITLVRGSGEGGPQGNSSAGWAVAS P 

cAiAwi^FVMALiYVMiifcii^^ 

llttatsvaiwvvwivmytygnkqhnsptwddptiju:aiaanaw 
apvlpyvipevsqvtksspeqsyogcmyptrgvgyetilkeqwg 
g^mfvenkafsmdepvaaio^vspysgyngqlltswqptemal 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(At-Alanine, OCysteine, D^Aspartic Acid, B= 
Glutamic Acid, ^Phenylalanine , 6=Glycine, 
H^Histidine, I=*Isoleucine, 10= Lysine, 
L=Leucine, M=Methionine, N=*Asparagine , 
P=Proline, Q=Glutamine , R^Arginine, 
S=Serine, r=lhreonine , V=Valine, 
W=Tryptophan, Y^Tyrosine, X^Unknown, *-Stop 
Codon, /spossible nucleotide deletion, 
\apossible nucleotide insertion) 








MHKVPSBGAYDX ILPRATANSQVMGSANSTLRAEDM YSAQSHQA 
ATPPKDGKNSQVFRNPYVWD 


6567 


125 


863 


TKRSNLKAYACS IHHIRTMS YVFVNDSSQTWVPLLQACI DGDFN 
YSKRIJJiKGFDPNIRDS RGRTGXJTLAAARC^^ 

r.T.7V*T r nV7VM r PH T UT .^V^lTl7TYPTOT7T TTRKIfir VTT>TnUU/V XTOT \/T. 
LiJjA 1 V jl -LALiril iUjHVIjA A \i c L» V LsLi RJ.U1 lJWtt\JC»A.i if Jb VXi 

AKRRGVNKDV I RLLE S LB EQ EVKG FNRGTI ! SKI. ETMQT AE SRS1A 

MB SHS LLNPNIjQQGEGVLSS frttwqe fvedlgpwrvlll I fvi 
AIiLSLGIAYYVSGVLPFVENQPBLVH 


6568 


3 


1183 


HASDRLLVLPDWYSHFSQASANLQGPSRTTELFHPTLAS ISSPM 

LEGAKLYTOVDHCm J EGLVRGCKASIJ J TXX3DYINLVQCETLKDL 

KIHLQTTDYGNPLANHTKPI.TVSKIEKTEMRKRLCGBFE 

I^PI^TFLTYMTC^YMIDNVIIJ^GALQKKSVKEII^KCHPr^ 

RFTEMFAVNIABTPSDLFNAJiaBTPI^FFQD^ 

IELLRNKLYKS YtiSAFYKFCKNHGDVTABVM CP ILEFEADRRAF 

IITLNSFGTBLSKEDRBTIiYPTPGKIjYPFJG^ 

NVADHYGVYKPIiFK AVGGS GGKTIiEDVFYEREVQM>JVIiAPNRQF 

HYGVFYATVKLKEQEIRWIVWIAECISQRHHTKINSYIPIL 


6569 


205 


1532 


RRRGPQRLGHGRPTPIiLCRP^TAGPSHWEKQARAKOGJLRPVDPR 
RMSWL FPLTKS AS S S AAGS PGGL.TS LQQQKQRL I ES LRNS II SSI 
AE1QKDVEYRLPFTINNLTIMINJLLPPQFPQEKPV1SVYPPIR 
HHIMKDGVYVTSPLVNNFTHHSDI^ 

PTSTAFP YLYSNPSGMS P YASQGFP FLPPYPPQKAMRS I TS LS V 

ADTVSSSTTSHTTAKPA7^SFGVLSmPLPIPTVI)ASIPTSQNG 

FGYKMPDVPDAFPELSEJaSVSQLTDMNEQEEVTiLEQFLTLPQLK 

QIITDKDDLVKSIEELARKmil^PSI.RAK^^ 

KS^FEKXMQRQHELSESCSASAIjQARIJCVARHEAEKESDNIAED 

FLEGKMEIDDFLSSF^iEJCRTI CHCRRAKESKLQQAIA^SQFHA 

ir Li 


6570 




1304 


ARLPRLTFLRBGFLYVLLSHWVFVGAPRPPASDSWKKGIjVPSAP 
PASRKMGSMU^API PLHPSLQIiTNYSF 
IiYGLSAVQTMHMNHWTIjG^ 

PFPALPFTTHLFln^KQGAXAH^PAIiHKDRPRFTJFAJrtAVAATQ 
EDPP KMGDLSKLS PGLGS P I SGGSKLTPDRKPS RGRLPSKTKKE 
F I CKPC^HFTKS YNLLIHERTHTDBRPYTCDI CHKAFRRQDHL 
RDHRYIHS KE3G?FKCQBCXJKGFCG^^^ 
SSAAKCSGBTVICGGT 


6571 - 


169 


656 


APDMNRKKUJKLTDTIiTKNCKH^ 

SkFLRGSLEBKMKYCFBVFDLNGDGFI SKEBWFHMLXNSLLKQP 
SEEDPDEG IKDLVEITI^KMDKX)HIX3KIjSFAIJ5rRTAVREETLIiL 
BAFGPCI,PDPKSQMEFEAQVFKDPNEFNDM 


6572 




l£4£ 


TPERAQPGALLGAAGCCVCGGRWWPRSHEPX5YFSSAKMGSKRRN 

LS GS ERHQ K LVD ENTCKKLIIVQALKNVNSQ I RNQMVQNENDNR V 

QRKQFLRIJ^^^FELDMBEAIQKAEEKKEIJKKI^I^QEEKLAM 

EIiAJKI:KHESIiKDEKI^QQVRENS I BLRELE KKLKAAYMNKERAA 

Q IAEKDAI KYEQM KKHAEIAKTMME EHKRI I KBENAAEDKRNKA 

XAQYYIiDLEKQIiEEQEKKKQBAYEQliLKBKIMIDBIVRKXYBED 

QLEKQQKLEK^lNA^IRRYIEEFQKEQAllWRKXKRBEMEEEWRKII 

KFANMQQQREEDRMAKVQENEEKPJ^LQNALTQKI^EMIJIQPJKD 

LEQVRQELYQEEQAE lYXSKIiKEEABKiCLRXQKEMKQDFEEQMA 

L KBLVLQAAKEBEENFRKTMIiAKFAEDDR I ELMNAQ KQRMK.QLB 

HRRAVEKIiIEERRQQFLADKQRELEEWQLQORROGPlNAIIEEE 

RUOiLKEKJm^LGYLPKGVFiaCEDDIDIu^ 

CEEK 


6573 


767 


275 


GGGGGESQS FPAOJDGTRTPAT^Cl^YX^PRKI^TQGGYDMVQK 
LFX)DFFRRRLSQRPTAEBLEQRNII»KPRNBQEEQBEKPJEIKRRL 
TRKLSQRPTVEELRERKI LXR FSDYVEVADAQDYDRRADKPWTR 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

ami no a/ii 

sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C~ Cysteine, D-Aspartic Acid, E= 
Glutamic Acid, Phenylalanine, G=Glycine, 
H^Histidine, I^Isoleucine, K=Lysine, 
It=Leucine, M=Methionine, N=Aspaxagine, 
P^Proline, Q=Glutaraine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryp t ophan , Y&Tyrosine, X«Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LTAADKVS RGECWRVGGRTVCWVSIiGSPLGS V 


6574 


204 


1159 


LESSVPVSVGVFWACGVSWTGAAGLUDGALSDTMARNAKKAMTA 
LARFRQAQLEEGKVKBRRPFLASECTBLPKAEKWRRQIIGEISK 
KVAQIQ NAGLGE PRI RDLNDE INXLLRBKGHWEVRI KELGGPDY 
GKVGPKMIjDHRGKEVPGNRGYKYFGAAKDLPGVRELPEKEPLPP 
PRXTRABI^KAJDFRYYGYIJJEDI^r^I^QBYBKKLRAEIiVE 
KWKABREARLARGEKEEEBREEEE IN I YAVTEEESDB EGSQEKG 
GDDSQQKFIAHVPVPSQOEIBEAIiVRRKKMELLQKYASKTLQAQ 
SEEARRLLGY 


6575 


117 


820 


SPALASQSGGITBBKMLfiPQEMGVIDLPDYEHVKDBTFPPFPpp 

ASPERQDGEGTEPDEESGNGAPVPVPPKRTVKRNIPKI.DAQRLI 

SERGLPALRHVFDKAKPKGKGHEAEDIiKMLIRHMEHWAHRJjFPK 

WFBDPIDRVEYLGSKKEVQTCLKRIRLDLPILHBDFVSl^ 

AENNEHDVTSTELDP FI/TNLSESEMFASELS I SLTEEQQQRIER 

NKQLALERRQAKLP 


6576 


1 


1060 


P E PQALVGQKRGALRLL VARIiVLl*VSAPAEVRRfeVLR PVLS WMD 
RF.TRALADS HFRGLGVDVPGVGQAPGRVAFVS EPGAFS YADFVR 
GFLLPNLPCWSSAF^TO^WGSRRRWVTPAGRPDFDHLLRTYGDV 
VVPVANCX^QEYNSNPlOilHMTLRDYI^ 
YLKDWfflLC^DFPVBDVFTLPWFSSDWJ^FWDALDVDD 
AGPAGSWSPFHADIPR5 FSWSVNVCGRKKHIiL FP PGOEEAIiRDR 
HGNLPYDVTSPALCDTHLIIPRNQLAGPPLE ITQEAGEMVFVPSG 
WKJIQVKiaLVMCCFSCFLSGAFLQEIXSSTTSPLSQ PELGWNGVAH 
G 


6577 


2271 


987 


SDRMASDDFDIVIEANLEAPYKKEKDEQQRKEVKKDYPSNrTSS 
TSNSGNETSGSSXIGBTSNRSRDRDRYRRRNSRSRSPGRQCRHR 
SRSWDRRHGSESRSRBHRREDRVHYRSPPLATGYRYGH5KSPHF 
RBK5PVREPVDNLSPEERDARTVFCM0LAARIRPFJDLBDFFSAV 
GKVRDVRIlSDRNSRRSKGIAYVEFCEIQSVPIiAIGLTGQRLLG 
VPI I VQASQABKNRIiAAMANIMKGNGGP«RIj YVGSWIFNI TED 
MLRGIFBPFGKIDNTVLMKDSITIGRSI^ 

BQIiNGFRI^RPMRVGHVTERLDGGTDITFPIXiDQF^IJ3SAGG 
RFQl^AIO^AEGAGIQLPSTAAAAAAAAAAQAAAI^tJJGAVPI^ 
I^PAALTALSPALNEJ^QCIiQriSSLFTPQTM 


6578 


377 


1489 


PSSSATMNRAPIiKRATlLHMALlGASDPSAEAEANGEKPFIJ^ 
t^IAI»WSIV5roVTSISMVFI^KYI^S 
VTTMjCRGIiSAIiAACCPGAVDFP3LRI*Dh^^ 
MITFNNI/njKYVGVAFYNVGR5LTT V FNVLLS YLIiLKQTTS FYA 
LLTCGI I IGGFWI^VDQEGAEGTI^WI^TVFGVLASLCVSLNAI 
YTTKVLPAVDGS IWRLTFYNNVNACII^FLPLIJLLI^Bl^ALRDF 
AQLGS^PWGMfmjGGLFGFAIGYVTGLQIK^ 
KACAQTYLAVLYYBETKS FLWITOSNhfMVIjrGGSSA YT^f\H^WEMK 
KTPEEPSPKDSEKSAMGV 


6579 


2 


711 


RPPRVWYPELRELSAAAPRWSHRTAPGIMVFYFTSSSVNSSAY~ 

IYMGKDKYt^EDLXKHG>JPEDIWFHVUKI>SSJ\lIVYU^IKGENl 

EDIPKEVTJ4DCAHLVKANS IQGCKMNNVNWYTPWSNLKKTADM 

DVGQIGFHRQKDViCIVTVEKKVNEILNRLEK^^ 

KCRDREERHEKKAQIQEMKKJIEKEEMKKKREMDEIJISYSSLMKV 

ENMSSNQDGNDSDBFM 


"~~65B0 


62 


1571 


LVALKNWKPKGTNI PAPQSPVFGEAVSGVYMMTKVLGMAPVLGP 
RPPQEQVGPLMVKVEEKEE KGKYLPSLEMFRQRFRQPGYHDTPG 
PREALS^I,RVLCCKWLRPKIHTKEQILELLVLBQFLTII,PQKLQ 
AWVQEHCPESAEEAVTIXPJDLERETJ^EPGHQVSTPPNEQKPVWE 
KISSSGTAKES PSS MQP QPLETSHKYES WGPL YIQESGEEQE FA 
QDPRKVRDOUiSTQHBBSADEQKGSBAEGLKGDl ISVI I ANKPE 
ASI^QCVNLENEKGTKPPI^EAGSKKGP^ VPTKPTPGERRY I 
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' SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue, of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C=Cysteine, D^Aspartic Acid, E- 
Glutamic Acid, ^Phenylalanine , G=Glycine, 
H~Histidine, I=*Isoleucine, K=I»yeine, 
^Leucine, M^Methionine, NiAsparagine, 
P=Proline, Q=Glutamine, R=Arginine , 
S=Serine, T-Threonine, V=>Valine, 
W=Tryptophan, Y-Tyrosine, X^Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








CAEOTKAFSNSSNLTKHRRTirTGEKPYVCTKOGKAFSHSSNLTIi 
HYRTHLVDRPYDCKCGKAFGQSSDLLKHQRMHTBEAPYQCKDCG 
KAFSGKGSLIRHYRIHTGEKPYQCNECGKSFSC^AGLSSHORIJI 
TtlKKPYKCKECGKAPNHSSNFNKHIIRIHTGEKPYWCiniCGKTFC 
SKSNLSKHQRVHTGEGBAP 


6581 


228 


476 


RVFIiKDItSSTPMASNNTAS IAQARKLVEQLKMEAN IDRI KVSKA 
AADI»MAYCEAHAKEDPbbTPVPAS ENPFREKKFFCAIIi 


6582 


1428 


718 


CPTTKTHCSPVSVPYLSPI.VLRKEDBSLLENEGDQVIHTSSPIN 
QHPI 1 FWTliVW YFRRLDLP5NLPGLILT5 EHCNEGVQ1.PLSSLS 
QDSJCLVYIQLLWDNINLHQEPREPLYVSWRNFNSEKKSSLLSEB 
QQETSTLVETIRQS IQHNNVLKPINLLSQQMKPGMKRORSLYRE 
ILPI^LVSLGRENIDIRAFTDNEYGIAYJNSLSSElLBPJjQKIDAP 
PSASVEWCRXCFGAPLI 


6583 


487 


41 


RI FSKrraGRLRWRCTWRPATALWSASIiRLGTSSMHPS PRSISLP 
LSMMLSPI*PSNTRGI>S PTALFRS PDSEH ATSCPRLHLWRCRAPL 
RSPS PLGPJjQVTiPRS P1JIVHTHWSGKKVLGLQVQRSRSGTGPAC 
SQAGSGAVQGGNWCIF j 


6584 


189 


1750 


PLPMAAIX?PSSQ^3YVVRVPI0^KKYNIMAF1IAADKVNFAT 

WNQARLERDI^NKKIYQKKFWRSGAGSBFNRKLRRRARRKnKYG 

I VLKEFRPEDQPWIiLR VNGKSGRKFKG 1 KKGG VTENTS YY IFTQ 

CPDGAFEAFPVHNWYNFTPIiARHRTIjTA^ 

FS IMQQRRLKDQDQOHPEEEKEKRGRRKAS EJbR IHDLHDDLEMS 

SDASDASGEEGGRVPKAKKKAPLAKGGRKKKKKKGSDDEAFEDS 

ddgdfegqevdymsdgssssqeepeskakapqqeegpkgvdeqs 
dsskbseeekppbedkeeeeekkaptpqekkrrkdsseesdssk 
esdidseassaffmakkktppkrerkpsggssrgnsrpgtpsae 
ggstsstlraaaskleogkrvsbmpaakrlrldtgposlsgkst 
pq p psgkttpns gdvqvtedawryiitrkpmttkdliiikkfqtkk 

TGLSSEQ^WNVLAQILKRIJ^ERKrtrNDKMHFSLKE 


6585 


3 


1678 


GPIRWSRIDPFVGGDPRABASCSVZiHSKPHAMADSRDPASDQMQ 

HWKBQRAAQKADVI.TTGAGNPVGDKLNVITVGPRGPIXVQDVVF 

TDEMAHFDRBRI PER WHAKGAGAFG Y FBVTHDI TKYS KAKVFB 

HIGKXTPIAVRFSTVAGESGSTADTVRDPRGFAV^ 

VGNNTPIFPIRDPTI.FPSPIHSOKRNPCn'HIiKDPDMVWDFWSLR 

PESLHQVSFLF3DRGlPIX3HRHMNGYG3inFKLVNANGBAVYCK 

FHYKTIXKSIKNLSVElJAARIiSQBDPDTC 

TFYIQVMTFKQAETFYFNPFDIiTKW 

PVNYFABVEQXAFDPSNMPPGIK^ PDKMLQGRLFAYPDTHRHR 
LGPN YIJIIPWCPYRARVANYQPJXJPM^DNQC3GAPNYYPNS P 
GAPEGQPSALBHS IQYSGBVRRPOTANDI»JVTQVRAFYVNVIiNB 
BQRKRLCENIAGHLKDAQI FIQKKAVKNFTEVHPDYGSHIQAI,!, 
DKYWAEKPKNAIHTFVQSGSHLAARBKANIi 


6586 


32 


804 


PLPEQPA2STSTMPVSGTPAPNKKRKSSKLIMBLTGGGQKSSGI, 

\!TJ~ZKK~T QVDDnVMT.RRT L <tT .T/PIOIXZG VMmTT.DOMD WVTTTW Him 
IHJLftjrVJU. j VIrIU/VriuBBlK>l u ilHIftwflflT IViAtvwHKViSl^r X XlSBIMy 

DVFSDS SMDHFQ KFL PTVGGQLGTAGQGFS YS KSNGRGGSQAGG 
SGSAGQYGSDOXlHHIiGSGSGAGGTGGPAGQAGRGGAAGTAGVGE 
TGS^UQAGGEGKH ITVFKTYI SPVfERAMGVDPQQKMB^ 
YGAKABLPKYKSFNRTAMPYGGYBKASKRMTFQMPKV 


6587 


75 


1117 


RRVPSLGKMPECWDGEHDIOTPYGIJ^HVVIRGSPKGNRPAILTY 

TOVGLNHKLCFNTPFNFEDMQEITKHFWCHVDAPG 

POGYQFPSME^IiAAMLPSWQHFGFKYVIGIGVGAGAYVLAKFA 

LIFPDLVEGLVLWrDPNGKGWIDWAATKLSGLTSTLPDTVLSH 

LFSQBBLVNlTrFXVQSYRQQIGNVVNOJ^HLQLFWNMYN 

INRPGTTONAKTLRCPVMLWGDNAPAE^^ 

PLKMADSGGLPQVTQPGKbTEAF^ 

RTASLTSASSVIX?SRpQACTH3B3SF^liGQVNHTMBVSC 
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SEQ 
ID 
NO*. 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


(A-Alaninc, C=>Cyeteine, D*Aspartic Acid, E= 
Glutamic Acid, P^Phenyl alanine , G^Glycine, 
H**Histidine, I-Isoleucine, K=Ly3ine, 
^Leucine, M=Methionine, N=Asparagine , 
P= Proline, Q=Glutamine, R«Arginine, 
S=Serine, T=Threonine, V=Valine, 

Tryptophan, ^Tyrosine, X=0nknown, *«Stop 
COdon, /t=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6586 


137 


501 


LGLQAQI^EI*RTNNYQL3DELRKNGVELTSLRQKVAYLDKEFSK 

ttnVQr.lWQVTfhnPlfCUT t c;T?MI?MT AT\ trT t»o ApETMTii t rtvrert*T win 
nU imi Xw IvIVrtyK V t» V JjLo Ha &nJjUAlLLixu»QBJS£/ r RXiQNSXli MA 

EFSKLCSQWEOLEQENQQljKBGAAGAGyAQAGP 


6589 


2 


1405 


RPWGSLAI^TFSROfiFFQQIiIjQGCI>LP'rAQQGLDQIW 

CRI^WRLGLPSYIjKHASTVAGGPPSLYHPFQLHMVWVV^ 

YLVLFXCRHSSHRCVFIjSVTILIYIjLMGB 

WIVAMKAVSLGrT>IJ)RGBVGTVP3PVBnWYLYPVGTIVFGPW 
IS PHSYI^VQGRPLSCRWI^KVARjSIdu^AI^ 
FPYEI PLNGDRLLRIJKKRKMGTT^ 
GFLSEATATLAGAGF1T2KKDR1J2 WDIiTVS 
TSWWLPMS YWLNNYVFIQ^RUiTPSAVLVTYAASALl^ PH 
IJUlVl^IJ^ITxVBHVI.RKRIJU«rtiSACVM 
UJI/3VRALNLLFGALAI FHLAYLGSL FDVDVDDTTEEQGYGMA Y 
TVHKWSELS WASHWVTPGCWI FYRLIG 


659b 


2177 


656 


VRAYHHVI^LLEOTFTPMraiRDEx^QLIiR 
GSI^IJDDFRmXJKRGESFGISRIGSKIKGVFKSTTMEGAMLPNY 
GVAEGKDDFIEJ^IVVMEDDSPVEAVSTPNTPRNLAAWKISIPY 
VDFFEDPSSERKEKKERI PVPCrDVERNDRKAVGHEPEHWSVYR 
R YIiEFYVLBSKLTKFHGAPPDAQLPSKRI igpknyeflks kree 
FQEYLQKLLQHPELSNSQIiLADPLS PNGGETQFLDKILPDVNLG 
KI IKSVPGKLMKEKGOHLEPFIMNFINSCESPKPKPSRPELTIL 
S PTSENNKKLFNDLFKNNANRABNTlmKQNQMYFMEV^r^VEGVY 
DYI2MYVGRWFQVPD W LHHLLMGTR I TiFHtTTLEMYTD YYIjQCKL 
EQLFQEHRIiVSLITIiLRnAIFCEOTEPRSl^DKQKGAKOTFEEM 
MNYIPDI^VKCIGEETKrreSIRLLFDGIjQQPVIjNKQLTYVIiLDI 
VI QEI>FPEIiNKVQKEVTS VTS WW 


6591 


2177 


656 


VRAYEHVLSIJ^ENVFTPMFCHRDEYFRQLLRGAESPTRNSKLNR 
GSI^LDDFWTQKRGBSFGISRIGSKIKGVFKSrTMEGAMIiPNY 
GVAEGEDDFIEEGIVVMEDDSPVEAVSTPNTPRNrjAAHKISIPY 
VDFFBDPSSBRKEKKERIPVFCIDVERNDRRAVGHEPBHWSVYR 
RYLEFYVI^KLTEFlIGAFPDAQIiPSKRI IGPKNYEFLKS KREE 
FQFOTLQKIiLQHPBLSNSQI»IjADFLS?^ 

KIIKSVPGFQjMKEKGQHLEPFIMWFINSCBSPKPKPSRPEIiTIL 

SPTSENNKKLFNDLFKNIIANRABNTERKQNQNYFMBV^ 

DYIMYVGRVVFQVPDWIflmjJi4Gra 

FX)LFQEHRLVSLITLI»RDAIFCENTEPR^I^DKQKGAKQTFEBM 
MNY I PDLLVKC IGE BTKYES IRLL FDGLQQP VLNKQLTYVLLDI 
VIQKLFPELNKVQKBVTSVTSWM 


6592 


3 


1861 


APEFLGSTI S SGSM IDANLKLLQBAEQRIjKAXVAEKyAXATKEG 
DLPQVBRFFK I FPLLGLHBEGIiRKFSE YLCKQVAS XAEENLLMV 

T/imM^nWPZk&VT 1?JVlYPT/rT.T.17l?f2 T 2k O TUEfl'UIYD T twiwrnarp 
lajx unouKiuut v x r/uiiiii xjlu: jsx? ulki vnxa\i "«* * I uirv»K 

LYTLIKYLQVECWQVEKVVDKFIK^ 

TTEKI E PRELDP I LTEVTIJ^NARSELYLRFLKKR I SS DFEVGD3 

MASEEVKQEHQKCLDKIJ^a^CTMQEMGLYVTMEK 

TVNKAVALDTYE KGQLTS SM VDCVFYIVKKCIGRALS SS S IDCL 

OVMINIiATTELESDFRJDVLCNKLR^FPATTFQ 

IMHSSL(XXSKFDTKGIESTDEAKMSFI»^^ 

KTI^DCTKIiFSQGIGGEG^OJUCFD^ 

GLTELKT STAIKPQVQPWINS FFSVSHNI EEEEFNDYEANDPWVQ 
QFILNLEQQMAE FKASLS PVI YDSLTGLMTSLVAVELEKVVLKS 
TFNRI^GI^FDKEIiRSLIAYLTTVTTWTIRDKFARLSG^TILN 
IiERVTEILDYWGPNSGPLTWRLTPAE^QVLALRIDFRSEDIKR 
LRL 


6593 


3 


1837 - 


EAFSAGSRRKGIiAI^RGVLGGI^YCPCCC^ 
RGGEGGGGRGRGDKRRRRQARRQRRRPE PAEARGGKMADVLS VL 
RQYNIQKKEI WKGDEVI FGEFSWPKNVKTNYVVWGTGKEGQPR 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=sAlanine, C-Cysteine, D^Aepartic Acid, E= 
Glutamic Acid, F^Phenylalanine, G=Glycine, 
H=Histidine, I»Isoleucine, K=&ysine # 
LsLeucine, M»Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R-Arginine , 
S=Serine, T=Threonine, V^valine, 
W-Tryptophan, Y=Tyrosine, X=Un known, *=*stop 
Codon, /"possible nucleotide deletion, 
V=possible nucleotide insertion) 








EYYTLDSILFLLNNVHIiSHPVYVRRAATENIPVVRRPDRKDLLG 

YLNGE AST8 AS I DRS APLE IGLQRSTQVKRAADBVLAKAKKPR I 

KDEBCVRLDKBRIiAARLEGHKEGIVQTEQIRSIiSKAMSVEKIAA 

I KAK IMAKKRSTI KTDLDDD I TALKQRS FVDAEVDVTRDIVS RE 

RVWRTRTTILQSTGKNraKNIFAlLQSVKARBEGRAPBQRPAPN 

AAPVDPTIrflTK^P IPAAYNRYDQERFKGKF^TO 

GOTLKS VTJSGASARKTQTPAAQPVPRP VSQARP P PNQKKGSRTP 

III IPAATTSLlTMLNAKDIiIiQDLKFVPSDEKKKQGCQRENETL 

IQRRKDQMQPGGTAI SVTVP YRVVDQPLKLMPQDWDRVVAVFVQ 

GPAWQFKGWPWLLPDGS PVDI FAKIKAFHLKYDEVRIiDPNVQKW 

DVTVLKI^SYHKRHIiDRPVFLRVWETI^RYMVKMKSHIiRF 


6594 


1 


1096 


KFPGRRFRGSQASPLCAT<!tePALLRAPTRAAMTRSLFKGNFWSA 
DILSTIGYDKIIQIILNNGRKNCKEFEDFLKERAAIKERYOKDIiL 

ARKMEEFRKXQKLQRKKTET;IMDAIHKQKSLQFKKTMDAXKNYK 
QKCRDKDKAEQAVSRSAKLVNPKQXJEKLFVKLATSKTAVEDSDK 
AYMLHIGTLDKVREEWQSEHI KACKAFBAQECBRIHFFRNALWIi 
HVNQCiSQQCVTSDKM YEQVRKSLEMCS IQRD I EYFVNQRKTGQ I 
PPAP IMYENFYSSQKNAVPAGKA'PGPNLARRGPLPI PKS S PDDP 
NYSLVDDYSIJ.YQ 


6595 


57 


781 


PI^TMSDSDLGEDEGIJ>SLAGKRKRRGNLPKESVKILRDWIjY1H 
nvmavp^roetctj^t .rfjfytnt .kvt.o t hnwf t mar rrt it.pdmlrkd 
GKDPtfQFTI srrogkasdvalprgsspsvlavs vpaptnvlsls 

VCSMPLHSGQGEKPAAPFPRGELESPKPLVTPGSTLTLLTRAEA 
GS PTGGLFNTPPPTPPEQDKKDFSSPQLIiVEVAI^RAAEMKLQK 
QQDPSLPLLHTPIPLVSENPQ 


6596 


2 


1026 


PRLPVRRYHGRRR1WRSRGHMAEGDAGSDQRQNEEIEAMAAIY 

GBEWC^IDDCAKIFCIRISDDIDDPKWTlXILQVMIiPNEYPGTAP 

PIYOLNAPHLKGQBRADLSNSLEEIYIQNIGESILYLWVEKIRD 

VL IQKSQMTKPGPDVKKKTBEEDVBCRDDL I IiACQPES SVKALD 

FPISBTRTEVEVEELPPIDHGIPITDRi^TFQAHIAPVVCPKQV 

KPWIjSKIiYENKKIASATBWIYAYRIYCBDKQTPI»QDCBDIX5CT 

AGGRIaLHIJ^ILNVKNVMVVVSRWYGGILLG 

ILVEKNYTNSPEESSKAIGKNKKVRKDKKRNEH 


6597 


2 


1026 


GFJWCVIDDCAKIPCIRISDDIDDPKWTI^^ 
PIYQLNAPWLKGQERADLSNSLBEIYIQNIGESILYLWVBKIRD 
VL I QKSQMTEPGPDVKKKTE EEDVECEDDLriiACQPESS VKALD 
FDISETOTEVEVEELPPIDHGIPITDRRSITOAHLAPVVCPKQV 
KTWl^KLYENKKIASATHNIYAYRIYC^DKQTFLQDCEDDGETA 
AGGRLLHIJ4E IIJJrVKNVMVVVSRW YGGI LLGPDRFKHINNCARN 
ILVEKNYTNSPKESS KALGKNKKVRKDKKRNKH 


6598 


1099 


419 


PRVRHATTMAMSFEWPWQYRFPPF^TLQPNVDTRQKQIiAAWCSL 

VLSFCTRLHKQSSMTVMEAQBSPIjFNNVKIiQRK^ 

LRKKGNliEWIi3KSKSSFLIMHRRPEEWGKI,IYQWVSRSGQNNSV 

FTLYBLTNGEDTEDEEFHGLDEATIjIjRALQAIiCW i itvs 

IXJPRROVLI^TOiPIJ^TSHI^RAFKRRQTCXrPPKTGSVTPPD 

SKGLQS 


6599 


164 


1593 


KMAALTTLFKYIDENQDRYIKKLAKWVAIQSVSAWPEKRGEIRR 
MMEVAAADVKQIX5GSVELVDIGKQKIiPDGSEIPLPPILLGRLGS 
DPQKKTVCI YGKO^V^PAALEDGWDSEPFTLVERDGKLHGRGST 
DDKGPVAGWINAI^YQKTGQEIPVNVRFCLEGMEESGSEGLDE 
LIFARKDTFFKim)YVCISDNYMjGKKKPCIl'YGLRGICYFFIE 
VKCSNKDIJISGVYGGSVHEAhrrDLIIJ^GSLVDKRGIilLIPGlN 
E AVAAVTK EEHKL YDD I DFD IEEFAKDVGAQ r LLHS HKKD ILMH 
RMRYPSI»SI»HGIBGAFSGSGAKWIPRKVVGKFSIRl,VPNwrPE 



52$ 



WO 01/53312 



PCT/USOO/34263 



SBQ 
ID 
HO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

amino ar»"id 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Ami nn apiH ao/*mc*Ti ^ con fai Ti*i net <s i rma 1 nonf *i /4o 

(A-Alanine, CwCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F- Phenylalanine, G=Glycine, 
H=Hiatidine, I=Isoleucine, K=Lysine, 
I>-= Leucine, M«Methionine , N=Asparagine , 
P=*Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W= Tryptophan, Y=Tyrosine, X-Unknown, *=Stop 

\~possible nucleotide insertion) 








WGEQVTSYLTKKFABLRSPNBFKVYMGHGGKPWVSDFSHPHYIj 
AGRRAMKTVFQVEPDLTREGGSIPVTLTFQKATGKNVMLLPVGS 
ADDGAHSQNEKLNRYMTIEGTKMLAAYLYKVSQLKD 


6600 


2 


934 


PGRLKRVAAMESAGLEQLIJ^ELLLPDTERIRRATKQr,QTVLRAP 
AALSALCDLLAS AAD PQ I RQFAAVLTRRRUTTRWRR LAAE QRE 3 
Li KS Li I IjTALQRB^I IKHCV S IiSIiAQI*S AT I FRKKOJjBAW PQLJjQLijb 
QHSTHSPHS PBRBMGLUiLSVVVTSRPEAFQPHHREIiIkRLLNET 
IXJRVGSPGLLPYSLRTLTTMAPYLSTEW 
TLIPIDEAKACEALBALDELIJ&EV^ 

NVALGNAJR IRI LCCLTFLVKVKS KALiLKNRUATLAAHPFPHC 
GC 


6601 


529 


1420 


PRAAARAP P PAVLRR DRRAATAPGAGEMTLHGPIAQRYFI^ 

KlTTWQDPRKAMNQPLNHMNUJPAVS3TP^ 

^QHQQQMAPS TLSQQNHPTQNP P AGLMSM PNAIiTTQQQQQQKL 

RLQRIQMBRERIRMRQEEIJIRQEAAIjCntOLPMEAETIiAPVQAAV 

NpplTm^MRSITNNSSDPKbNGGPYHSREQSTDSGIiGLGCYSV 

PTTPEDFXSl^EMDTGENAGQ-rPMNINPQQTRKPDPXIX^PGT 

NVDIiGTI.BSEDLIPl,KNDVSSAIJ)IKSRPPLTWL 


6602 


127 


617 


IJ^FPALPKWIAQSPKAGKP31>1TSMTQS1»REVI KAKT KARNF 
HRVI/JKJTLVSAAPGKVICBMKVEEEHTNAIGTLHGGLT^ 
NI STI4ALLCTERGAPGVSVDMNI TYMS PAKLGEDI VT XRHVLKQ 
GKTLAFTSVDLTNKATG KL I AQGRHTKHLCN 


6603 


79 


660 


PVGPSSIAARTGLGHLPFlflRIiASSRGLr^ 

SGMGATGTLRTSLDPSLEIYKKMFEVKRREQL^^ 

HQQYKILDVMLKGLFKVLEDSRTVLTAADVLPDGPFPQDEKLKD 

AFSHVVKNTAFFGDWLRFPRIVIIYYFDHNSNV?NLIiIRWGISPX: 

NQTGVPNQGPHSPILSLM 


6604 


3 


688 


TSTAQRQGGERMS FRGGGRGGFNRGGGGGGFNRGGSSNHFRGGG 
GGGGGGNFRGGGRGGFGRGGGRGGFNKGQDG^PPERVVLLGEFL 
HPCEDDl^CKCTTDEl^PYFtJAPVYIiENKEQIGinraEI FGQLR 
DFYFSVKLSENMKASS FKKIiQKFYIDPYKLLPLQRPLPRPPGEK 
GPPRGGGRGGRGGGRGGGGRGGGRGGGFRGGRGGGGGGFRGGRG 
GGFRGRGH 


6605 


7 


348 


SGSRRGAMRAAGVGLVIXJHCHLSAPDFDRDI^ 

AI»VAVABH8GBPKIttMQISERYNGFVI^^ 

SVTLKDLDVALPI IENYKDRLLAIGEVGLDFSPRFAGTGEQKEE 

ORO^IRQlQIAKRIiNLPVNVHSRSAGRTO 

HAFDGRPSVAMEGVRAGYFFSIPPSI IRSGQQKLVKQLPLTSIC 

LBTDSPAIX5PFKQVR^^WNIStSABYIAQVKGISVKEVIBVTT 

QNALKuFPKTJMLLQK 


6606 


2 


1682 


K8LGL lEGYGGRGKGGLPATIiSPAKE e kakgphekygyns ylsk 

KISI*DR3IPDYRPTKCKBLKYSla)LPQISirrc 

SVHSAVNHTPTHUiI^IILVDDNSDBBEIjKVPLEEYVHKRYPGL 

VKWRNQKREGL IRARI BGWKVATGQVTCFF1JAHVBFTAGWAEP 

V1»SRIQENRKRVILPSIDNIKQDNFEVQRYENSAEGYSWELWCM 

YISPPHIJWWDAGDPSIJPIRTPAMIG^FVVNRKFTGBIGLLDPG 

MDVYGGENIBIX3IKWlXGGSMBVLPC^RVAHIrmKXKPYNSNI 

GFYTKimALRV7uWWHDDyr^HVYIAWNLPLENP<3IDIGDVSER 

RALRKSLKCKNFQWYLDHVYPEi^YNN^ 

l£KX5PLEimTAILYPCHGWGPQLra 

DTRCLVDNS KSRLPQLJjIXZUKVKSS LYKRWNFI QNGAIMNKGTG 
RCLF^KNRGLAGIDLILRSCTGQRWTIKNSlK 


6607 


137 


986 


VPA<^LlttKW*SIa^PPRIu^ 








KIEDELBMTMVCHRPEGLF^LEAQT^^ 
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SBQ 
ID 
NO: 


Predicted 

nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

ami rin af**tfl 

sequence 


Predicted end 
m i r»T eofc ■? rif* 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 

\A«/uaiuuc F v»—v*ys uu-Ulc t U^rlB^Jat Liu MCJLUf B— 

Glutamic Acid, F= Phenyl alanine , G=Glycine, 
H=Histidine, I=lsoleucine, Kt=I»ysine, 
^Leucine, M^Methionine, N=Asparagine, 
P= P rol ine , Q-Gl ut amine , R=Arginine , 
S^Serine, T=Threonine, V-Valine, 
W^Tryptophan, Y-Tyrosine, X»Onknown, *=Stop 

\.uuuii| f "^JoaaiDAe nuncot iqc tit.it. uj.(_>ii, 
Vpossible nucleotide insertion) 








SGWNEDTFKQIYAQFFPHGDASTYAHYLFNAFDTIViXJSVKFE 
DFVTAI*S ILLIWSIVHBFCIjRWTFNLYD INKDGYINQEEMMD IVKA 

TvnMMPirvinrDrnT mrwvTTi'Of\il\i iiirpp/^lTMTMrwnrTvnifT tvqctt d 
JLiummrKX 1 1 1» viiKKITI J*K(JHV£>V Jr JrUKPVDluXllLDbXVXuDJSffliB 

SCQEDDN I MRSLQLFQNVM 


6608 


224 


1140 


RPCPSSPTGLCPRLSYPM1LLQHAVI*PPPKQPSP5PPMSVATRS 
TGTLQL PPQKP FGQEAS h PIAGBRBl^ KGGEQDCALBBIjCXPIjY 
CKLCNVTIjNSAOOAQAHYQGKNHGKXI^ 

wbeaatpwp vppqmgs fkpggrvi latendycklcdasfssp 
AVAQAHYQGKNHAKRLRIiAEAQSNSFSESSELGORRARKEGNEF 
KMMPNRRNMYTVQNNSGP YFNPRSRQR I PRDLAMCVTP SGQ FYC 
SMaHVGAGBEMEFROjmESKQHKSKVSEQRYRN^^ 


6609 


1 


443 


FRLRCRRFRVAGGRIAQAGLRKSRVPAPKQRLSALTLLSWSAVT 
PAAEPGNFQL3 PAEPRGPLASPVRAAPRAPCPAAEMSELNTKTS 
P ATN QAAGQBE KGKAGNVKKAKE EE E I DI0LTAPETEKAAJLAIQ 
GKFRRFQKRKKDPSS 


6610 


319 


881 


GRKS L CNLH I P I RFPLTYPDM YMGMM CTAKKCG I R FQPPAI ILI 
YESK I KGKIRQRIMPVRNFSKFSDCTRAAEQLKNNPRHKS YTjKQ 
VSLRQLEKLFS FLRGYLSGQS I^ETMEQIQRETTIDPEEDLNKIj 
DDRBIJ^KSlNDBI^KNQKKKDDPWVm^IBVBFPQDDQIiQS 
CGWDTE3ADEF 


6611 


978 


212 


pgcsgagsrvwwiipalrhliamgstes segrrvs fgvdeeer vrv 
i^vrlsfjwvnrmkepsspppaptsstfgujixjn^raph^ 

LPRSGSSGGO^PSGMKEGVKRYEQBIOVAIQDKIiFQVAKREREAA 
TKHSKASLPTGEGSISHBEQKSVRLARELESREABLRRRDTFYK 
EQLERIERKNAEMYKLSSEQFHEAASKMESriKPRRVEPVCSGli 
QAQ I UICYRDRPHEVIiIj CSDL VKAYQRCVSAAHKG 


6612 


1724 


992 


VSTHASALSRTQGQPOROPRAAASGAGAGTAGGGGSGGAEGSKM 
STEAQRVDDSPSTSGGSSDGDQRKSVQQEPBRBQVQPKKKEGKI 
SSKTAAKLSTSAKR IQKELAE I TIJDPP PNCSAGPKGDNI YE WRS 
TI LGPPGS VYRGGVFFLD I TF3 PDY PFKPPKVTFRTR I YHCNTN 
SQGVI CLDILKDNWSPALTISKVLLS I CSLLTDCNPADPLVGS I 
ATQYMTNRAEHDRMARQWTKRYAT 


*6l3 


130 


748 


ELBLSSNMPEQSNDYRVAVFGAGGVGKSSLVTjRFVKGTFRESYI 
PTVEDT YRQVI SCDKS X(Tri^I T17ITGSHuFPAMQRI>S ISKGHA 
FIIiVYSlTSRQSLEHIiKPIYEQICEI KGDV3SIPIMLVGNKCDE 
SPSRBVQSSFAFAI^TWKCAFMETSAKIiNHNVKELFQEIJ^^ 
K^RTVSLQIDGKKSKQQKRKEKIJCGKCVIM 


6614 


3 


1191 


SSAAEANmVLVRRCWGPPI^GARRGRPSPQWIUU^ARIiGWBDCR 

DSRVREKPPWRVI»FFGTDQFARBALRALHAARBNKfiSELI0KIjE 

WTMPS PS PKGLPVKQYAVQS QLPVYEWPDVGSGEYDVGWAS F 

GRIJifEAIiIUrpPYGTLNVHPSajPRWRGPAW 

GVTIMQ IR PKRFDVGPILKQETVPVPPKSTAKEIfE AVIiSRI/3AN 

MLISVLKNLPESLSNGRQQPMEGATYAPKISAOTSCIKWEEQTS 

EQIFRLYRAIGNI IPIiQTLWMANTIKI^LVEVNSSVIJVDPKLT 

GQALI PGSVrtfHKQSQI LLVYCKDGW I GVRS VMLKKSLTAXDFY 

NGYLHPWYQKNSQAQPSQCRFQTIJiLPTKKKQKKTVAMQQCIE 


6615 


B32 


35 


GRVGAGASAMSEI*PGDVRAFLRBHPSLRI>QTDARKVRCIIiTGHE 
LPCRLPELQVYTRGKKYORLVRASPAFDYAEFEPHIVPSTKNPH 
QLFCKLTLRHINKCPEHVIJU^^ 

VPACTjVHRRRRRBDQMDGDGPRPREAFWE PTSSDEGGAASDnSM 
TDLYPPI^FTRKDIXJSTEIXaDGTDDFIiTO 

EGRRETTVYRGLVQKRGKKQLGSLKKKFKSHHRKPKSFSSCKQS 
G 


6616 


347 


1886 


LLPPCQGARPLSSPPHASEDNLFLFWNCILCAFPHPSPQPLQYP 
VWPLLLVTTQIPAPRHLRNRPFSFSRGGLDSFSGSLSTPSICRS 
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SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C= Cysteine, D^Aspartic Acid, E~ 
Glutamic Acid, F^Phenylalanine, G^Olycine, 
H=Histidine, I=Isoleucine, K«I»ysine, 
L=Leucine, M^Methionine, N^Asparagine, 
P^Proline, Q -Glut amine, R»Arginine, 
S= Serine, IVThreonine, V»Valine, 
W=Tryptoplnanf YaTyxoflme, Unknown , *»Stop 
Codon, /^possible nucleotide deletion, 
\-possible nucleotide insertion) 








PAV^MAPWPPKCJIiVPAVLWGI^nFJiNLPGPIWLQPSPPPQSSP 
PPQPHPCHTCRGL VDS FNKGLERTIRDNFGGGNTAWE EEMLS KY 
imooTQT.xr e*ijt wr , \mcvt!T\wnooT T.PT.oppr in?cuui?tnronca 

PDLFQWLCSDSLKLCCPAGTFGPS CXPCPGGTRRPCGGYGQCEG 
EGTi^GSGHCDCOAGYGQFJ^CGQCGI^YFKAERNASHLVCSACF 
GPCAKC5GPEE\SNCl^CKKGWALHHIiKCVDIDECGTEGANCGAD 
QFCVNTEGSYJSC3a)CAiCACIiGCf^ 

LDVDECETEVCPGEN KQCKNTEGGYRCI CAKGYKQMRGlCVKEQ 
IPESAGFFSEWEDELWLQQMFFG1IICALATLAAJCGDI.VFTA 
I PIGAVAAMTGY WLS ERSDRVLEGFIKGR 




118 


673 


WKAWQVSIiBLBDRl*QCPICIiEVFKBSLMI^XX3IlSYCKGGiVS 
I»3YHLDTKVRC^MCWQAVDGSS3LPNVSIiAWVIEA1jRLPGDPBP 
KVCVHHRNPI*SLFCEKDQEIi I CGLCGI»LG5 hqhhpvtp istvcs 
RMKEELAMiFSEI»KQEQKJCVDELIAKLVKNRT^ 
LGPATFTFIi 


6618 


548 




DGIO/ARRAPNSPAFQITOIYPLVSAPRATTAESPW 
NVPKMTSRRSRI P CLE AARAEGTGKKQQEGRAMATUDRKVPS P E 
AFl^KPWCGWinAAKLHCGDNVDLEIiAGKRGGKSREVMKI^KEA 
WKYGT 


6613 


246 


842 


PASSHVLTAAVMFl,LI^CIVAVS0N^IGK3^GDIiPRPPI>RNEFR 
YFQRMTTTSSVE1GKQNI»V IMGRKTiffFS I PK KNRPLKDtt I NGVLS 
RELKRPPQGAHFLARSI .DnALKLTERPEUOJKVDMTWIVGGSSV 
YKiSAMNHLGHLKLFVTRIMQDFESDT FFSE IDLBKYKLLPE YPG 
IliSDVQEGKHIKYKFEVCKKDD 


6620 


3 


1879 


NSRVDDFVARARMAABNEASQBSALGAYSPVDYMSITSFPRIiPE 

DEPAPAAP^GRKDEDAFIjGDPDTDPDSFLKSARIiQRLPSSSSE 

MGSQIXJSPLRBTRKDPFSAAAAECSCRODGbTVIVTACIjTFATG 

VTWALVMQlYFGTOPQIFWGAVVTnAARCT 

BAAVAAALCIJGI VAPR S SGLGGGGVMLVHDIRRNBSHLI DFRES 

APGALREETLQRS WETKPGliLVGVPGMVKGIiHEAHQLYGRLPWS 

QVl^AAAVAQIX^ FI^1IDLARAI>AKQL P PNMS ERFRE TFL PS G 

RP PL PGSLLHR PD1ABVLIXVLGTSGPAAFYAGGNLTLEMVAEAQ 

HAGGVITEEDFSNYSALVEKPVCGVYRGHLVLS PPPPHTGPAL I 

SALNILEGFNLTSLVSREQALHWAmJCIAIALASRI^DPVYP 

AAQVXjIMGPDDFIVAMVS^IiNQPFGSGLITPSGIIJtjHSQMLDPS 
WPNRTANHSAPSLRNSVQPGKRPLSPLLPTVVRPAEGLCGTYLA 
LGANGAARGLSGLTQVRFTPWIiAFFSREPS CGLDCRCLS YLWLV 
SIPHAANHG 


6621 


1 


662 


VQGITSYO^RiQALRKEKSRDAARSRRGKE^ 
AAITSQLDKAS I IRLTI SYLKMRDFANQGDPPWNGRMBGPPPNT 
SVK1/IGAQRRRSPSAIAIBVFKAHLGSHIIX3SIJXWFALNQEG 
KFLYISETVS IYLGI^QVFJjTGSSVPDYVHPGDHVEMABQIjGMK 
LPPGRGIjLSQGTAEDGASSASSSSQSETPEPVVCFPPASDQFIiL 


6622 


2 


319 


GRASG AQKETEAGGPERARAMBANMPKRKEPGRS lri kvxsmgn 
AEVGKS CI X KRYCEKRFVSKYIiATIGIDYGVTKVIWRDREIK^N 
IFDMAGHPFFYEVRKPF 


6623 


1886 


189 


KALFEKVKKFRLfTV^EGDILYAMYVRQTVjjKVIKFLI I IAYNSA 
IiVSK\^F*JVDCNVDIQDMTGryKlfFSCNHT^ FCYLCF 
VS I YGLTCLYTLYWLF YRSLRJBYS FEY VRQETGFDDI PDVKNDF 
AF^n^HMIDQYDPLYSKRFAVFI^BVSK^^KIJCQLNIJ^WTPDKL 
RQKI^TNAHNRLEXjPLI MLSGIjPDTVFE ITELQSI»KI>E I X KNVM 
I PATIAQLDNLQRLS LHGS^ WIH5 AAI^FLKENIiKVLS VKFDD 
MRELPPWMYGliRNLBELYLVGSLSHDISRNVTLESLRDIiKSLKI 
I^IKSWSKrPQAVVinrSSHriQKMClHliu^ 
IiTEI^VHCDJLERIPHAVFSLLSI^Et»DLKENNLKSIEEIVSFQ 
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nucleotide 
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to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D»Aspartic Acid, E=« 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-Histidine, Idsoleucine, K= Lysine , 
I»=Leucine, M=Methionine, N^Asparagine , 
P=Proline, Q=Glutamine r R=Arginine, 
S=Serine, T=Threonine, V~Valiiie, 
W=Tryptophan, Y -Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








hlrkltvlklwhns ityipehikkltslerlsfshnkibvlpsh 
lfix^kiryldlsyndirfippkigvlqslqyfsitcnkveslp 
dklyfckklktucigknsi^vlspkiignllfi^yldgkx^fei 

LPPELGBCPJtfoKRAGLVVEDALFETLPSD^ 


6624 


218 


1786 


GS RRGGGS R I PAVS THVAPGRS VLRP FASCALRIJlSLVKAIiGGC 
RGRPSGIAHI^QETSHWRAKRSGRACLGDPPGEIIiRSFI^ 
RBWLRVTTVLFMARAI PAMVVPRATT t T>KTCTtriBKYMDEDGEWW IA 
KQRGKRAITDNEMQS IIiDLHNKliRS 

RSAESWAESCLWEHGPASLLPS IGQNLGAHWGRYRPPTFHVQSW 
YDEVKDFS YPYEHBCNP YC P FR CSGPVCTHYTQWWATSNR IGC 
AINLCHNMNII^IWPKAVYI,V^ 

ACPPSFGGGCRENLCTTKEGSDRYYPPRREBTNEIERQQSQVIIDT 
HVRTRSDDSSRNEVlSAOX}MSQ I VSOSVRLRP^ 
PAGCLDSKAKVIGSVHYEMQSS I CRAAIHYGI IDNDGGWVDITR 
QGRKHYFI KSNRNG I QTIGKYQSANSFTVSKVTVQAVTCETTVE 
QLCPFHKPASHCPRVYCPRKLYASKSTIiCS CNWNSSLP 


6625 


1124 


543 


PGPRGGGGSLLSTKALGRSRGLGMHPGPSSGGTEGGVPTALRPP 
GPIAJPSTSDDNUdOflBLFDKLAIJ^ 

SFYGQGRKlAEVCCTS I VYATEKKQTKVEFPEARIPBETIjNIIiI 
YBTPRGPDPAIjLEATGGAAGAGGAGRGEDEENREHRVRRIHVRR 
HITHDERPHGQQIVFKD 


6626 


3 


1498 


SAVEFVYTDRnilill/SISVEFliCSI^SimTMESITACLHAI^AL 
LDVPWPRSKJGSDCnDSGIEIOiNVLHRVILTRESPSIQLASLEVV 
RQI ICAAQEHVKEKRRSAEVDDGAAEKETLPEFGEGKDTGGIiVP 
GKS LVPATLELCVCILVRQLPELNPKIiTGSPGVKATKPOI lled 
GSRLVSAALVILSELPAVCSPEGS IS ILPTILYLTIGVLRETAV 
Kl»PGGQI^STVAASIiQALKGILSSPMARAEKSRTAWTDLI,RSAIi 
TT 1 LDCWDP VDETHQELDEVSLLTAI TVF I LS TS PBVTT I PCLQ 
KRC I DKFKATLE I KDPWQ I KTYQLIjHS I FQ YPNPAVS Y P Y I YS 
LASCIMBKLQKIDKRKPHNTAEIiEI FQBGIKVLETJjVTVAEEHH 
RAQLVACLIiPI LISFIiLDENSTiGSATS IMRNLHDFALQNLMQIG 
PQYS S VFESLVASSPALKARLEAAIKGNQBSVKVKIPTS KYTKS 
PGKNSSIQLKTSFL 


6627 


1 


697 


G IPHLSSRDMTGTPGAVATRDGKAPERSPPCS PS YDLTGKVMLL 

GDTGVGKTCFLIQFMX3AFIjSGTFIATVGIDFRN3KV^ 

KXQIWDTAGQBRFRSVTHAYYR13AQ^ 

WLTE IHEYAQRDWIMLLGNKADMS S BRVIRSEDGETLAREYGV 

PFkETSAKTGMNVELAFIJU^AKELKYRAG^ 

SQKKRSSCCSFM 


6628 


1 


1861 


QCAE FGGGSGGGGGSGGGGSGGGRGAGGEENKENERPSAGS KAN 
KEFGDSLSLB IIX5 1 1 KBSQCX3HGLRIIGDFQRYRG YCSRRQRRLR 
KTUffFKMGNRHKFTGKJRVTESIjLTDKIR 

QLKQEANTEPRKREmj*SRLRKAVKHAEEI^RLCBSNRVDAKTK 

LEAOAYTAYLSGMLRFKHQE WKAAIEAFNKCKT I YEKLASAFTE 

EQAVIiYNQRVEE ISPNIRYCAYNIGDQSAINELMQMRIiRSGGTE 

GLLAEKLEAl>ITO/TRAKOAATMSEVEWRGRTVPVJCJDKVRIFLL 

GIADNEAAlVQAESEETKFJiLPESMI^SCRDAIQVVREEL 

KQRDYILEGEPGKVSKLQYIiHSYLTYIKLSTAIKRNENMAKGLQ 

RALLOOQPEDDSKRSPRPODIjIRLYDIILQNLVELLQLPGIjEED 

KAFQKBIGLKTLVFKAYRCFFIAQSYVLVKKWSEALVLYDRVLK 

YANEVNSDAGAFKNSLKDLPDVQELITQVRSEKCSLQAAAILDA 

NDAHQTETSSSQVKDNKPLVERFETFCLDPSI 1 VTKQANLVHFPP 

GFQPlPClOPLFFDIJUiNWAFPPLEDKLEQKTKSGLTGYIKGIF 

GPRS 


6629 


5653 


4549 


GATPLGS VGGRTGKMDAATLTYDTLR PAKPBDFPBTSEFVW ILG 
RKYSIFTBKDEILSDVASRLWFTYRKNPPAIGGTGPTStrrGWGC 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine , C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F==Phenylalanine, G=Glycine, 
H=Histidine, I-Isoleucine, K^Lysine, 
h= Leucine , M=Methionine , M^Asparagine , 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T- Threonine , V=Valine, 
W- Tryptophan, Y^Tyrosine, X-Unknown, *»Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








MLRCGQMIFAQALVCRHLGRDWRWTQRECRQPDSYFSVI.NAFIDR 
KDS YYS IHQIAQMGVGBGKS IGQWYGPNTVAQVLKKLAVFDl'WS 
S LA VH I AMDNTWME KIRK I,CRTS V? CAGATA FP ADS DRH CNC F 
PAGAKVTNRPS P WRPLVLLI PLRLGLTDINEAYVKTLKHCPMMP 
QSKmGGKPNSAHYPIGWGBELIYliDPHTTQPAVBPTDGCFI 
P DBS FHCQE P PCRMS I AF. LDPS IAVVRGGHT>STQAFGAE CCLGM 
TRKTFX5FLRFFFSMLG 


6630 


2 


423 


LVQCGG IRRRS AWGAMPGRHVSR VRAIj YKRVLQLHRVLP PDIiKS 
I^DQYVKDEFRRlIKTVGSDBAQRFI^KWKVYAl'AIiUX^ANRNRQ 
NSTGKACFGTFLPEEKliNDFRDEQIGQI^EL^EATKPNRQPSI 
SBSMKPKF 


6631 


2 


423 


LVQCX^IRRRSAWGAMPGRHVSRVRAXiyKRVLQLHRVLPPDriKS 
LGDQYVKDEFRRHKTVG3DEAQR PT^QEWEVYATAI^LQQANEJJRQ 
NS TGKACFGTFI*? BEKLNDFRDKQ IGQLQELMQEATKPNRQ FS I 
SBSMKPKF 


6632 


X273 


588 


WNSRGRTORGAAPLAPAAAMKAWQRVTRASVTVGGEQISAIGR 
GICVIJ^ISLEDTQKKLEHMVRKXLI^^ 

KQYEILCVS^PT£iQCVLKG17KPDP1QjAMPTBQAEGF¥NS pleql 
RKTYliPBLIKIXSKFGAYMQVHIGJWDGPVTIELRS PAPGTATSDP 
KQLS KLEKQQQRKEKTRAKGPSESS KERNTPRKEDRS A3 SGABG 
DVSSEREP 


6633 


1145 


617 


ATGRHEG VPTLKG I IQQLVNGHTPATIPSLGPWGVbHSNPMDY 
AWGANGIJ3AI I TQLLNQF^lTrGPPP ADKEKIQALPTVFVTE EHV 
GSGLECPVCKDDYAIjGERVRQI^CNHLFHDGC IVPHLKQHDSCP 
VCRKSLTGOOTATNPPGLTGVSFSSSSSSSSSSSPSNENATSNS 


6634 


1 


1134 


CGG I PRKG^PRRRLPMARUU)CI,PRIJ1IiTI^IJ J FWSI,VYCYC 
GLCASIHI^LKIiLWGLGKGPAQTFRRPAREKOPPACLSDPSIjGTHC 
YVRlKDSGLRPHYVAAGBRGKPLMLLLHGFPEFWYSWRYQIiREF 
KSBYRVVALDLRGYGETDAP IHRQNYKLDCLITDI KDXLDSDG Y 
SKCVLlGHDWGGMIAWLIAICnn?EMVMK^ 

T.RHPAQLUCSSYYYFFQIPWFPEFMFS INDFKVLKHL.FTSHS TG 
IGR5G3CQLTTKDLEAYIYVFSQPGALSGPINHYRNI PSCLPIJCH 
HMVTTPTLLLWGFJ'JDAFMEVBMAB^ 
WLQQDQPDIVNKLIWTFLKBHTRKKD 


6635 * 


1420 


470 


BMRAGQQLASWIiRWTRAHRLPREGIiGPHGPSFARVPVAPSSSSG 
GRGGAEPRPLPIiS YRIiUDGEAAEiPAVVFIiHGLFGS KTNFNS I AK 
IIJlG^JTGRRVLTTOARNHGDSPHSPDMS 

LVPCSATVGHSMGGKTAHLLALQRPELVFJiLXAVD^ PVESTGVS 

Hl^ATYVAAMRAINIADFXPRSRARiUj^ 

LT^VEVIX5RFVWRVNU1&LTQ^ 

LGGNSQFVHPSHHPEU^FPPJ^QMQTVPNAGHHIHADRPQDFI 
AA1RGFLV 


6636 


1514 


1801 


S FCMFSHKQDSHFQAVPVQEKiaCRliRRAPWRAFAQPQRLKHPAB 
QPIVROXLQRPPI>CGVLGPVQO^LPPSI^VLSPHSDPXWCH.VD 
DGGDGVF 


6637 


2 


1501 


CSSSPCFHJXTTC^DKAGSYKCACajVGYTGORCENI^ 

KASBDSLSVLEFJWCSDPGGPVNGYQKITGGPGLINGRI1AKIGT 

WSFFCNNSYVLSGNEKRTCQQNGEWSGKQPICIKACREPKISD 

LVRRRVL PMQVQSRETPLHQL YSAAFS RQKLO^APTKKPAIjP FG 

DLPMGYOjnjHTQLQYKO^PFYRRI^SSRRTCLRTGKWSGRAPS 

CIPITOKIENITAPKTOGLRWPWQAAJYRRTSGVHDGSLHKGAW 

FLVCS GALVNERTVVVAAHC^rroLGKVTM I KTADLK WDGKF YR 

DDEIRDEKTIC^LQISAJCIIiHPNYDPIIiDADIAlLKI^ 

TR VQP I CLAASRDLSTS FQESII I TVAGWNVLADVRS PGFKNDTL 

RSGWSWDSLLCBBQHEDHGI PVSVTDNMFCASWEPTAPSDIC 

TAETGGlAAVSFPGRASPEPRWHLMGLVSNSYDKTCSIIRIiSTAF 
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amino acid 
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amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A»Alanine, C=Cysteine, D^Aspartic Acid, B= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Histidine, Islsoleucine, K^Lysine, 
L=Leucine, M=Methionine, M=A3paragine, 
P= Proline, Q^Glutamine, R=Arginine, 
s=Serine, T=Threonine, VaValine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\~possible nucleotide insertion) 








TKVLPFKDWIERNMK 


£638 


1391 


224 


GGIPQAGGKMAAPWWRAALCECRRHRGFSTSAVLGRRTPPLGPM 

PNSDIDLSNLERLBKYRSFDRYRRRAEQEAQAPHWWRTYREYFG 

EKTDPKEKIDIGLPPPKVSRTQQLLKRKQAIQKLRANVEBKRAA 

RLRTASVPLDAVRAEWBRTCXpPYHKQRIjAEYYGL^ 

VPRVPLHVAYAVGKIX)IWPVYCCNEVTPTEAAQAPBVTTEAEEG 

SLWTI^LTSLDGHLIsRPXiAEYLHWLLTNIPGNRVAE^VTCpyL 

PPFPARGSGIHRIAFliLFKQDQPIDFSEDARPSPCYQlAOkTFR 

TFDrcKKHQETMTPAGLSFFQOTWDDS^^ 

FVRPPPYHPKQKRFPHRQPI^YLDRYRDSHBPTYGIY 




■ 2046" ' 


" 1268 


"IGCFIMDGGDDGNLI IKKRFVsfeAElJJERRKRRQEEWEKVRKPB " 
DPEECTBEVYPPI^LYBm^EQKDRKO^EYBEQFTCFKNMVRGLD 
EDETNPLDEVSRQQELIEKOjiREBEIjKELKKYRNNLKKVGI SQB 
NKKBVEKKLTVKP IBTKNKFSQAKI»IiAGAVKHKSSESGNSVKRL 
KPDPEPDDKNQEPSSCKSLGNTSIiSGPSIHCPSAAVCIGIbPGL 
GAYSGSSDSBSSSDSEGTINATGKI VSSI FRTNTPLEAP 


6640 


117 


1043 


VI^PPDVSMAESEDRSIiRIVl*VGKTGSGKSATANTILGEEIFDS 
RIAAQAVTKNCQKASREWQGRI)LLVV^ 

ISRCIISSCPGPHAIVLVLIJiGRYTEEEQKTVALIKAVPGKSAM 
KHMVIIiFTRKEEIiEG^ FHDPI ADADVGI»KS IVKECGNRCCAFS 
NSKKTSKAEJCESQVQEI»VEI»IBTO1VQCN^^ 
r J KQREEVLRKIYTDQT«NEE I KLVBEDKHKSEEKKEKKI KLLKLK 
YDBKIKNIREEAERNIPKDVT^RIWKMLSEIWHRFLSKCKPYSS 


6641 


1 


894 


SAAVGRRSEWGCAPRPRLRRSAIUU4DPVPGTDSAPIJ\GIJ\WSS 
ASAP P PRGPSA I SCTVEGAPASFGKS FAQKSG YFLCLSSLGS LB 
NPQENWADIQIWDKS PLPIjGFSPVCDPMDSKASVSKKKRMCV 
KLL PLGATDTAVFD VRI>SGKTKTVPG YLRIGDMGGFAI WCKKAK 
APRPVPKPRGLSRDMQGLSLEAASQPSKGGIjIiERTASRIjGSRAS 
TIJ^RNDSIYEASSLYGISAMDGVPFTIjHPRFEGKS^PIAFSAF 
GDLTIKSIJu^IBEJBYNYGFVVEKIAAARIiPPSVS 


6642 


22 


1296 


PLEERMMTKMDPNDQAQRDI I FELRRIAFDAESDPSNAPGSGTB 

KRKAKTTKDYKMLGFTKHINPAM)FTQTPPGMLA 

HODTYIRIVI^BNSSREDKHBCPFGRSAIEr.TKMLCBILQVGELP 

NEGRm)YIIPMFFTHDRAFEBLPGICIQI^K^WKEMRATAEDFN 

KVMQWR EQI TRALPSKPNSLDQEKS KLRSLS YS B ILRliRQS ER 

MSQDDFQSPPIVEIJ^IQPEILELIKQQRIiNRIiCBGSSFTlKlG 

NRRRQJ^FWYCRLALNHKyiiHYGDI^^ 

ADIKAIVTGKDCPHMKEKSALKQNKE VLB LAPS ILYDPDETLNF 
!APNKYEYCIWI]X5LSALLGKDMSSELTKSDUmJ^MEMKl^ 
LDLENIQIPEAPPPIPKBPSSYDFVYHYG 


6643 


3049 


2265 


SLHAPAEGRTRGRIiAEKPKMLXRKJKIjTOIHAI1ITCRLCSGYI>I 
DATTVTECIiHTFC^SCIjVKYIjBENNTCPTCRIVIH 

hdrtmqdi vyklvpgi^kaemrkqrbfyhki/smevpgdi kgetc 
sakqhi^shrngbtkaddssnkbaabbkpeedndyhrsdeqvsi 

CLECNSSKXJIGLKRKWIRCSAQATVIiHLKKFIAKK^ 
DILCNEEILGKDHTTiKFVVVTRWRF^^ 


6644 


1489 


290 


PRPIATEP RGSS P VQLVSSTMSVRTLPLLFI^fLGGEMLY I LDQR 

lraqn i pgdkarkvlnd 1 i stmfl*rkfmebl»fkpqel>ys kkalr 
tvyerlahas imklnqasmd klydlttrmafkyqvllcprpkdvl 
lvtfnhldtikgfirdspti lcxtvdetlrqiite i ygglsagefq 
lirqtiiliffqdlhirvsmflkdkvqnwngrfviipvsgpvpwgt 
evpglirmfnnkgebvkriefkhggnyvpapxegsfefygdrvl 
klgtnm ysvnqpvethvsgs sknlaswtqbs i apnplakeblnf 
iiarlmggmeikkpsgpepgprlnlfttdeeeeoaaltrpeblsy 
evin iqatqdqqrseblar imgefe iteqprlsts rgddijiamm 

DEL 
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ID 
NO: 
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beginning 
nucleotide 
location 
corre sponding 
bo first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
(A=Alanine, C*=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine , G=Glycine, 
H=Histidine, I=Isoleucine, K= Lysine , 
L^Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R-Arginine, 
S=Serine, T=*Threonine , V=Valine, 
W^Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6645 


} 6530 


4646 


FVEGLAGYVYKAASEGKVLTLAA^ 

GQRSTPIil IAARNGHAKVVRLLI^HYRVO/rQQTGTV^ 

GATALWCAAGAGHFEVVKIjLVSHGANVNHTTVTNSTP 

GRLDIVKYLVENNRNI S I ANKYDNTCI^IAAYKGHTDWRYLLE 

QIUuDPNAKAHCGATALHFAAKAGHIDIVKKLI kwraaxwnghg 

WPLKWUffiSCKADVVKLLI^SHADCDimSRIETV^^ 

RKNYD 1 1 ICTYHYL YIAMLBRFQDGDNI LB KEVLPPIHAYGNRTK 

OUJPQELESIRQDMJAIiHMEGLIVRERILGADNIDVSHPIIYRG 

AVYADNMB PKQCI KLWLHALHLRQKGNRNTOKDIJjRFAQVFSQM 

IHLNETVXAPDIECVLRCSVLEIEQSMI^^ 


YECNLYTFLYLVCISTKTQCS BEDQCKINKQ I YNL IHLDPRTRE 
GFTLUO^WSNTPVDDPHTNDVCSFPNALV^ 
VDNBGNSALHI IVQYNRJP I SDFLTLHS 1 1 I S LVEAGAHTDMTNK 
QMKTPLDKSTTGVSEILLKTOMKMSLKCIAARAVRA1TOINYQDQ 
IPRTLBEFVGFH 


6646 


176 


890 


PSSRMNHIJ>BDMENALTGSp^SHASIdW^ 

EGREKKGI SDVRRTFCLFVTFDLLFVTLLHI IELNVNGGIKNTI* 

KKE VMQYDY YS S YFD I FLLAVFR FKVLI LAYAVCRLRHWWAIAL 

TTAVTSAFLlAIOT:i»Sia^SQGAFGYVLPIISPILAWIKTWPLD 

PKVLP<3EAE EENRLLI VQEASBRAAIil PGGLSDGQFYS PPESEA 

GSEEABEKQDSEKPLLEL 


6647 


176 


890 


PSSRMNHLPEDMENALTGSQSSHASriRNIHSINPTQLMARIESY 
EGRE KKGI SDVRRT FCLFVTFDLL FVTLLWI I ELNWGG I ENTI» 
EKEVMQYDYYSSYFDIFLIiAVFRFKVLILAmVCRLRHWWAIAL, 
TTAVTSAFLLAKVILSKIiFSQGAFGYVLPI ISFILAWIETWFLD 
FKVLPQEAEFJ^RLLIVQDASERAALIPGGLSDGQPYSPPESEA 
GSEEAEE KQDS EKPLLEL 


6648 


413 


897 


RN CWNCFTKY FNS PPED I DHKDS YL ITRS I MAEPDY I BDDNPEJb 
IRPQKLiDreVKTSRNHQDIdiRBIiLt^ 

KRKRDQVI KQKEEEAQKKKSDLE IELLKRQQKLEQIiELE KQKLQ 
BEQKNAPEFVKVKGNLRRTGQEVAQAQES 


6649 


1357 


832 


WIPRAAGIRKJBVKWDVKEIMSO#NIYVDAIi^ 
SKRVRIPLPVSNILWEHCIRLANRTIVEGYANVKKCSI^GPJU>1 
QliDFO^FLMKLKKLTDIRPI PDKEFVETYIKAYYLTENDMERWI 
KFJiPJEYSTKQI.TNLWWCX^ 


6650 


32 


765 


LVPLVFSIiVQSCSCQVryRSIAMKFVPCI^VTM 

QKO^SlX^FHFQTGGRrscrMRPSSW30G^ 

OrYWCEYRGQPSWCQAFAADPKSYWNO^ 

I^PSVCREAGPOAHMQQVTSSLKGSPKPNQQPEAGTPSIiRPKAT 

\^TEATQI^KDSMEELGKAKPTTRPTAKPTQPGPRPGGNEEAK 

KKAWEHCWKPFQALCAFLISFFRG 


6651 


3425 


1353 


AKKLLKTODFSLC^GP YO^Am^KNLSKEPIiAS FVSES FDISA 
CGIATEHVKIDNS GEGLTAEAGSETLS RDGEVGVNSDMHYEI*SG 
DSDIJ)LLGDCRNPRI^LEDSYTIiRGSYTRiQa)VP^ 
HNNNQEDWGCSSWVPGMBTSIjPPGHWTAAVKKEEKCA/PP yvqi r 
DLHG I LRTYAN FS ITKELKDTMRTSHGLRRHPS FSANCGL P S SW 
TSTWQVADDLTQNTLDLEYLR FAHKLKQTI KNGDSQHSAS S ANV 
FPKESPTQISIGAFPSTKISFJVPFIiK^APRSRSPLLVTVVESDP 
PJWQPRRGYTASSLDSSSSWP^CSHNRDI^SQRNHTVSFH^ 
NKLKJfNSTVKBSRNDISLIIiNBYAEFNK^ 
DVSGEATAQEMYLPFPGRSAS YEDI I IDVCTNIJttVKLRSVVKEA 
CKBTFLFYLVETEDKSFFVRTKNIiLRKGGHTE IEPQHFCQAFIIR 
ENOTLIII IRNEDISSHIilQIPSIiKLKHFPSVIFAGVDSPGDV 
IJDHTYQELFRAGGFVISDDKILKAVTLVQLKEI IKILEKLNGNG 
RWKWLLHYRENKKIJCEDBRVDSTAHK^^ 
raQCDSRSSTKAEIIJCCLI^QrQIIII^FAVIXTDKPTIPREV 
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ID 
NO: 
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beginning 
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location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A~Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid. ^Phenylalanine , (^Glycine, 
H=Histidine, I=Isoleucine, K=rLysine, 
L=Leucine, M=Methionine, ifeAsparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Thxeonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








FBNNGILVTDVNNFIENIEKIAAPPRSSYW 


6652 


2 


1343 


I PGSTISCSCHSRRLRGGSPAPRLSLGAASPRPRPPSLPLPLPL 
PPPLPLPTRPAERAWIRSRRASKWVGKMBVPRIJDHALKSPTSPC 
EBVIKNLSLKAIQLCDRDGNKSQDSGIABMEBLPVPHNIKISNI 
TCDS PKIS WEMDS KS KDR1THYPIDLNKKENKNSNKFKH1CDVPT 
KLVAKAVPLPMTVRGHWFLS PRTByTVAVQTASKQVDGDYWSK 
WSEI IBFCTADYSKVHLTQIiLEKAEVIAGRMLKPSVFYRNOTKE 
YFD YVRBHHGMAMQ PS VKDNSGSHGSPI SGKLBGIPFSCSTBFN 
TGKPPQDS P YGRYRFEI AAEKLFNPNTNLYFGDFYCMYTAYHYV 
ILVIAPVGS PGDEPCKQRLPQLNSKOMKPLTCTISEDGVLVYHHA 
QDVI LEVT YTD PVDLS LGTVAE ITGHQLMS LSTANAKKD PSCKT 
CNISVGR 


6653 

i 


170 


1910 


FFLEPRLRPFPASRARPVPARTRPSPLHPCCFCFEGGGSMLSPQ 
R VAAAAS RGADDAME S S KPG PVQWLVQKDQ} I3FEKDK KALiAS I 
LLQDHIRDLDVVWSVAGAFRKGKSPIIJJPMLRYI,YSQKESGHS 
NWLGDPEB PLTGPS WRGGSDPETTGIQI WSEVFTVEKPGGKKVA 
VVLMmXK^AFDSQSTVKDCATIFALS'^ 

DI^LQLFTEYGRLAMDEIFQKPPQTLMPLVRDWSFPYKYSYGIi 

CK^MAPXDKRI^VKJBHQHEEIQNVPJ^HIHSCFSDVTCFIiLPHPG 

IX>VATSPDFIX3KLKI>IAGEFKEQLQALIP^ 

SKVTCRGLLEYFKAYIKIYQGKDIiPHPKSMIXJATAEAYNLAAAA 

SAKDIYYNNMEBVCGGEKPYTiSPDILEEKHCKFKQIiALDHFKKT 

KKMGGKDFSFRYQQELEEEIKELYENFCKHNGSKNVFSTFRTPA 

VLFTG I VAL YI ASGIiTGFIGLE VVAQLFNCH VGLLL IALLTWG Y 

IRYSGQYRELGGAIDFGAAYVI3QASSHIGNSTQATVPJ3AVVGR 

PSMDKKAQ 


6654 


1 


705 


RTSLSPSQCSSFNIjAMASAGMQII^VVIjTLLGm^NGIjVSCALPNJ 
WKVTAFIGNS IVVAQVVWEGLWMSCW^TGQMQCKVYDSLIiAL 
PQDLQAARAIj CTIAIiLYALFGLLVYLAGAKCTrTCVKE KDS KARL 
VLTSGIVFVISGVLTLIPVCWTAHAVTRDPYNPLVAEAQKRBLG 
ASLYI^WAASGLLLLGGGIJ^CTCPSGGSQGPSHYMARYSTSAP 
AISRGPSEYPTKNYV 


6655 


341 


16 


KI^YMFKKGIJ^U^VFSLPVPAABHWIDVRVPEQYQQEHVG<3A 
IN I PLKEVKER I ATAVPDKNDTVKVYCNA^ 
YTHVBNAGGIiKDIAMPKVKG 


6656 


2 


1212 


TBIjPPRPANLAIQPPI^PIiRAIjAPLPEKPGAVPPPOKRMAKVAK 

DLNPGVKKMS IX^IiQSARGVACLGCKGTCSGFE PHS WRK I CKSC 

KCSQEDHCLTSDI^DDRKIGRUJ^DSKYSTLTARVKGGDGIRIY 

KRNRMIMTNPlATGKDPTPTrmnnSfDl^ 

XE KQPVTGTEGAFYRRRQLMHQLP iydqdpsrcrgllenelklm 

EEFVKQYKSEALGVGBVALPGQGGI^KEEGKQQBKPEGABTT^ 

TTNGSLSDPSKBVEYVOTLCKGAAPPDSPVVYSDRAGYNKQWHP 

TCFVCAKCSEPLVDIilYFWKDGAPWCGRHYCESLRPRCSGCDEI 

I FAED YQRVEDLAWHRKHFVCEGCEQ JjbSGRAY I VTKGQLLCPT 

CSKSKRS 


4657 


830 


2126 


LLTCQFJ^GDCLLSASTMKEVVYWSPKKVADWIiLENAMPEYCEP 
I*EH FTGQDLINLTOEDFKKPPLCRVS SDNGQPJjLDM IETLKMEH 
HLBAHKNGHANGHLNIGVDI PTPDGS PS IKI KPNGMPNGYRKEM 
I KI PWPBLKRSQYPMEI^ICTFIiAFIj YALS CFVLTTVM I SWHER 
VPPKEVQP PLPDTFFDHFNRVQWAFS I CEINGM ILVGLWLIQWI* 
LLKYKSI ISRRFFCIVGTLYLYRCITMYVTTLPVPGMHFNCSPK 
liFGDWEAQLRRIMKLIAGGGLS ITGSHNMCGDYLYSGHTVMLTL 
TYIiFIKEYSPRRLWWYHWI C^IiLSWGIFCILLAHDHYTVDVVV 
AYYTTTRIJ?WWYOTMANQQVI»KB^ 
NVQGI VPRS YHWPFPWPWHIiS RQVK YS RLVNDT 


6658 


35 


855 


HCCALGAPGS P YRGLYFSSAAPCTAPRKAKHQSTLEGI»TKRMIiM 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C*>Cysteine, EfeAspartic Acid, E« 
Glutamic Acid, F«Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine # KaLysine, 
L ^Leucine , M=Me thionine , N^Asparag i ne , 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine r 
Wr^Tryp t ophan , Y^Tyrosine, X=Unknown, *=»Stop 
Codon, /—possible nucleotide deletion, 
\»possible nucleotide insertion) 








FDPVPVKQKAMDPVSVSYPSNYMBSMKPNKYGVIYSTPLPEKFF 
QTPBGIiSHGIQMBWDLTVNKRSSPPSAGKSPSSLKFPSSHRRA 
SPGLSMPSSSPPIKJCySPPSPGVQPPCTPL^PVMAAALSBHG 
IRSPGILPVIQPVWQPVPFMYTSHI^PLMVSLSERMENSSSS 
MQVPVIBSYEKPISQKKIICIEPGIEPQRTDYYPEEMSPPLMNSV 
SPPQAIMB 


6659 


18 


523 


EPQRGIXSTWFQNCSLPKFVCFFC^FWI^RAHSMSNiaSIiPGL 
RGLTS I SRNQLQCTTCAMRVI NNYQPJiWKNQWr^ 
CX?NPTiTCPHNRTIimOT 

AQTPAtMFYIVACDNRnQRRDPPQYPWPVHLHTI 1 


6660 


514 


1707 


CAASbDCRHHU^PDMK^^ 

LPLLLEQFHKHSQSSQRRTILEMLLGFLKLQQKWSYEDKDQRPL 

NGFKDQLCSLVFMALTDPSTQLODVa IRTLTVliGAQPDLLSYED 

LF^VGHLYRI>SFLKED3QSCRVAAXiEASGTLAALYPVAFSSHL 

VPKI^ELRVGF~ f 3NLTNGDEPTQCSRHT.Cav3AI>SAVSTIIPSIV 

KBTLPLLIiQHLWQVNRGNMVAQS SDVI AVCQSIiRQMARKCQQDP 

ES CW YFHQTAI PCLLALAVGASMPE KEPS VI>RKVI^EDEVLAAM 

VSVIGTOTTHLSPBIJUVQSVTHIVPLFLDGNVSFLPENSFPSRF 

QPFQJDGSSGQRRIiIALIJ4AFVCSLPRNV^ 

PG 


6661 


179 


430 


GVHAASGTI^ATWIAEAKMFDSLAKAGKYLGQAAKI^ 
NYVEHMRVNHPDQTPMTYEBFFRERQDAKYGGKGGARCC 


6662 


165 


423 


R6LPKPAPAQPASIHCARFKGVTPPTAKTAMSDGNTAFNALMYC 
GPKADDGNI FSACAPASSAVKASVSVAQPGOAVIP 


6663 


3 


1005 


RPVI£SRVDDFVPPIJ?BTSGRRKKLERMYSVDRVSDDI PIRTWF 
PKKm,FSFQTASrrMC^eNFRKHLRMVGSRRVKAQTFAERRER 
SFSRSWSDPTPMKADT3HD3RDSSDLOS3HCTLDEAFEDliDWDT 
EKGIiEAVACDTEGFVP PKVMDI S S KVP KAE Y I PTI IRRDDPSII 
PILYDHEHATFKD ILEE I ERKLNVYH KGAKI WKMIUI FCQGGPGH 
IiYLLKNKVATFAKVEKEEDM IHFWKRLSRLMS KVNPEPNVTHIM 
GCYII^PNGEKLFQNIJiri^TPYRVTFF>SPLEIiSAQGKQMIET 
YFDFRLYRLWKSRQHSKLI*DFDDVI» 


6664 


58 


968 


PRIJJtt^RSVVVMDSPWDEIAIAFSRTSMPPFra 

LANHmiLLASSIWYITFFCPHDLVSQGYSYLPVQIJ^ 
TRT^ncrVGGVTHANSYYKNGWrVMIAIGWARGJ^TI ITNFERL 
VKGDWKPEGDEWLKMS YPAKVTLLG 3 VI FTFQHTQHLAISKHNI* 
MF-LYTI FIVATKITTWTTQTSTMTFAPFEDTLSWMLFGWQQPFS 
SOTKK^FJU<^PSNGVGSIJVSKPVDV7^DNVKKKJrrKKNF 


6665 


171 


1278 


DERRIACRQWTQQRS ELY PGFXJKRQRFLP KAGEFJUUIOGGRHL 
PGRWLGPGCTG^PCSVOTATGPEPRKLPLLPPDSPIISGYPKEPA 
ALCPGI PSPCRMTHQDIiS ITAKLIITC3GMAGLVGVTCVFPIDIAK 
TRIiQNQIIGKAMYKGMIDCU^UCTARAFXSFFG^ 
BKAI KLAANDFFRRJLLMEDGMQRNLK^ 

PMKMLKIQLQDAGRLAVHHQGSASAPSTSRSYTTGSASTHRRPS 
ATLI A WELLRTOGLAGLYRGLGJ^TLLRDIPFS 1 1 YFPLPANLNN 

IjGFNELAGKAS fahs fvsgcvags iaavavtpldvlktriqtlk 

KGLGEDMYSGITDCAR 


6666 


498 


2868 


MTTFLPVPOMMAGFSFGTFGNPPMESPSAWQTIHQPFIVSCLTL 
WSPGCWPQPIQKEGVGLWD IRKPQSSLLRYGGMLSLOSAMSVRF 
MSNGTQLIAIJUiRLPPVLYDmSRLPVFQFDNQVYFNSCTMKSC 
CFAGDRDQY ILSGSDDPI^YMWRI PADPEAGGIGRVVNGAFMVL 
KGHRS I VNQVRFNPHTYM I CSSGVEKI 1 KI WS P YKQPGCTGDLD 
GRIEDDSRCXYTHEEYISLVIJTSGSGI^HDYANO^VQEDPRMHA 
FFDSLVRREIBGWSSDSDSDLSESTILQLHAGVSERSGYTDSES 
SASLPRSPPPTVDESADNAFKIjGPLRVTTTNTVASTPPTPTCED 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
lA^ftlanine, C=Cyeteine, D=Aspartic Acid, B» 
Glutamic Acid, P^Phenylalanine, G=Glycine, 
H*»Hiatidine, I=Isoleucine, ^Lysine, 
L^Leucine , M=Me thionine , N~Asparagine , 
Pr= Proline, Q^Glut amine , R^Arginine, 
S-Serine, TVTitreonine, V=Valine, 
WsTryptophan, YaTyrosine, X-Unknown, *-Stop 
Co don, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








AASRQQRL3AI^yQDKRLIAI>SNBSDSEENVCEVEIiDTDLPPR 
PRSPSPHDESS3SSSSSSSEDBKKOT3RRASTHQRNAMRRRQKT 
TREDKPSAP I KPTNTY IGEDNYDYPQIKVDDLSSSPTSSPERST 
ST0EIQPSRASPTS1)IESVERKXYKAYKWLRYSXTSYSNNKDGE 
TSIiVTGEADEGRAGTSHKDNPAPS S SKEACLNI AMAQRNQDIjP P 
EGCSKDTPKEETPP^PSKGPGHEHSSHAWABVPEGTSQDTGNSG 
SVEHPPETKKI^KAI^SRAEEPPSPPVPKASGSTIWSGSQJCP 
RTOSDDSEEP3LETIC^HNNGIUiHPPa>PHPHlWGQNI^El£W 
AYSSPGHSDTDRDNSSLTGTIJiHKIXXGSBMACBTPNAGTRKDP 
TOTPATI^SRAVHGHSGLKPJQRIELHDTDSENSSGKKKLKT 


6667 


171 


1310 


AEEVBRLAAMRSDSLVPGTHTPPIRRRSKFANIjGRIPKPWKWRK 
KKSKKFKHTSAALERKISMRQSREEIiIIO^VIJCEIYDKDGSLSI 
SNEEDSI^G0SI^SSQLSIJ»ALSE>ffiPVPMPRDPC3YEVLQPS 
DlHDGP0PGAPVKLPCI^VKLSPPLPPiaCVMIC«PVGGPDLSI.V 
SYTAQKE<XK5GVAQUHHT\rLPSQIQHQLQYGSUGQHLP3TTGSL 
PMHPSGOlMIDBIifcnCTI*AinM^ 

GDGVTKAGP^LPEIRQVPTVVIECPrmENVPHESDYEDSSCL 
YTRBEEEEEBDEDDDSSLYTSSLAMKVCRKDSIiAIKPSNRPSKR 
ELEEKNILPRQTDEERLELRQQIGTKL 


666B 


714 


358 


TLAVATGPALTLRC3IVCTS S SNCKHS WCPAS SRFCKTTNTVEP 
LRGNT , VKKDCAE SCTPSYT LQGQVG S G TCSTQ CUQEDL CNE KT.H 
NAAPTRTAIJ^SALSLGLALSLLAVILAPSIj 


6669 


459 


1207 


KDEETRKDYDYMLDHPEEYYSirY YKTYSRRLAPKVDVRVVI LVS 
VCAISVPQFYSWWNSYNKAISYLAWPKYRIQATEIAKQQGLLK 
KAKBKGIOaKKSKEEIRDBEENIIKNIIKSKIOllCGGYQKPQICD 
LLLFQI ILAPFHLCSYIVWYCRWIYNFNIKGKEYGEEERLYI IR 
KSMK^KSQFDSI.EDHQKETFLKREIiWI KENYEVYKQEQEEBLK 
KKLANDPRWKRYRRWMKNEGPGRLTFVDD 


6670 


1S4 


594 


VARI*GEAAKMS3EPPPPYPGGPTAPIiLEEKSGAPPTPGRSSPA 
VMQPPPGMPLPPADIGPPPYEPPGHPMPQPGFI PPHWSADGTYM 
PPGFYPPP<3PHPPMGYYPPGPYTPGPYPGPGGHTATVkVPSGAA 
TTVTV 


6671 


1 


763 


LPAKKPRSAPNMAGGRaSPQLTAIiLAAWIAAVAATAGPEEAALP 
PEQSRVQPMTAS1WTLVMEGEHMI JCF YAP WC7PS CQQTDSE WEAF 
AKNGEII^ISVGKVDVTQEPGLSGP^PVTTLPAFFHAKDGIFRR 
YRGPGIFEDLQ^ILEKKHQSVEPLTGWKSPASI/TMSGMAGIjFS 
ISGOWHUINYFTVTI^IPAHCSYVFFVIATLVFGI>SMDLVL*V 
ISQOJWDPPYRHVS * /RPSTNLGVHTAHTSKHLRL 


6672 


304 


1089 


APGSKPVQFMDFEGKTSFGMSVFNLSNAIMGSGILGIiAYAKAHT 
GVI FFIiAIjLIrCIALLS S YS rKOjIJYrCAGIAG I RAYEQLGQRAFG 
PAGKVVVATVTCH^Kl^VGAMSSYLFIXKSEIiPLVIGTFLYMDPEG 
DWFLKGNlil IIVSVLIII.PLAI^KHIXSYIiGYTSGIiSLTXmFF 
I.VQVTYKin^Tirsr^YHAThlKOOWESEALVOTPOPRDSTAAVKAO 
MFJ^*LTGVLTQWPIMAFAFVCBPGGAGPSITELC^AFQAQD 


6673 


1116 


1963 


LQIQTHHTHHGARVTK1jGSIK}IJ^NAGTMIX^Q0SSSMAPAFSQ 
SVTCGPSPCVRKQESATKCIJHIGACGSDLWARGWEQG*G*GIJ^V 
WLC PCVA FHRGARPQAEEGGARWNSIjVS S PS IPPNP *HSS 1GAE 
NAVPRP*QG*KVNPSGQERQS \WVLPI»PVPGEPLKLPGLPG+NK 
SFSRV/SGSKGKWILPRQIM*A3*R\TPRPVPGTQWVPITW/PL 
ITWH*SAPTPPLKACPAPRESDPCSSCLSCPCVTQHPRFSDTGW 
FG AGHCHS S CDFTRKGAAGGPG 


6674 


1 


440 


LEFDYMCQYDYVEVRDGDNRDGQI IKRVCGNERPAPIQSIGSSL 
HVLFHSDGSKNFIXSFHAIYEEITACSSSPCP 
KCACU^TGQRCF^IJ^ERNCSDPG/WPSQWVPBNNRGPWAYQ 
PTPC* IGTRVAFFLT 
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SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D^Aspartic Acid, E=» 
Glutamic Acid, F=Phenylalanine, G-Glycine, 
H-Histidine, I=Isoleucine, K=Lysine. 
I>= Leucine , M=Methionine, N-Asparagine , 
P= Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, V=»Valine, 
W=Tryptophan, Y»Tyrosine, X=0hknown, *=Stop 

LOQOI1, / "pOBBXOlc IlUCl CO L- XvXti (UUctlUUi 

\=*p03sible nucleotide insertion) 


6675 


277 


1678 


GNWPTERMAFLDNPTI IIAHIRQSHVTSDDTGMCEMVLIDHDVD 

LBKIHPPSMPGDSGSEIQGSNGETQGYVYAQSVDITSSWDFGIR 

RJ^mMRLERLRKERQNQrKCKNIQWKEfiNSKQSAQKLKSLPB 

KKSLKEKPPISGKQSH^VRI^BQCPIflUCTFl^ 

GTOATBaCIDVYLPLHSSQDMXPMTVVTMT^ARVQDI.IGLlC^ 

YTSBGREPKLNDNV^YCI^I ABDWKVDTOPPPLDSNEP IHKF 

GFSTLALVBKYSSPGLTSKBSLPVRlNAAfiGFSbrQTOWTKVTM 

KEII*LKAVKR31KGSQKVSGSRAIXJVTEEDSQIDIATVQDMI>SSH 

HYKSFKVSM IHRI*RFTT1IVQI»/ GCAI»FI<3VIiRiaiAAPVDCLRPS 

ADTWRQBQIGCCGAACAALRS * , DSHKC*EGISGDKVKIDPVTNQ 

KASTKFWIKQKPISIDSDUXaC\DLAEE 


667<J 


277 


1678 


GNWPTERMAF1J^PTIII*AHIRQ£HVTSDDTG^ 

LBKIHPPSMPGD3GSBIQGSNGBTQGYVYAQSVDITSSWDFGIR 

RRSNTAQRIiBRIiRKERQNQrKCKNIQWKERNSKQSAQBLKSLFE 

KKSI>KEKPPISGKQSII»SVRLEGGPLQLNNPF^ 

GTTATKKIDVYIiPIiHS S QDRIiLPMTVVTMAS ARVQDLIGLI CWQ 

YTSKGREPKI>WDNVSAYa^ABDIX?SVDTDFPPLDSNB PIHKF 

GFSTIJUi^KY^PGLTSKHSLFVRrNAAHGFSLIQVDNTKVTM 

KEII»LKAVKRRKGSQK^SGSRAIX^EEDSQIDTATVQDMLS3H 

HYRSFKVSMIHRLRFTn)VQL/GCAI>FPGVIfRKPJ^APVDCLRP3 

ADTWRQEX3IGCO^CAALRS*DSHKC*EGISGDKVBlDPVTNQ 

KASTKFWIKQKPISIDSDLLCAC\D1ABB 


6677 


277 


1678 


GNWPTERMAFLDNPT 1 I 1 AHIRQSHVTS0DTGMCEMVLIDHDVD 
LEKIHPPSMPGDSGSBIQGSNGBTQGYVYAQSVDITSSWDFGIR 
RRSNTAQRLERLRKKRQNQ I KCKN I QWKERNS KQS AQH IiKSL FE 
KKSLKEKPPISGKQSILSVRX^CPI»QLNNPFNBYSKFIX3KGHV 
GTTATraCIDVYX,PLHSSQDRLLPMTVVTWXSARVQDI»IGI*IC^ 
YTSKGREPKLNDWSAYCmiABDlX3EVDTDFPPIiDSNBP IHKF 
GFSTIiALVBKYSS PGLTSKESIiFVRINAAIK5FSLIQVDMTKVTM 
KBII^KAVKRRKGSQKVSGSFJUWFBEDSQIDIATVQPMLSSH 
HYKS FKVSMIHRLRFTTDVQL/GCALFPGVIJIKRAAPVIX!1>RPS 
ADTWRQEQIGCCXjAACAAIjRS *DSHJtC*EGIbGDRVIalDF Vi MQ 
KASTKFWlKQKPISIDSDLLCACXDIiABB 


667B 


221 


"865 


GPSNQSSGSt^LIVTGCSSYWS*IND'rCTILRVLSSNFGRQ*LR 
PFPCSQIJPMS^LWHIJXXCTWVPxTP^ 
LLGSDQBSVGLBDLCVFVNFLIBVIfctiGIiFP* PHEI*PI*LP WDLG 

flfplllgggctclvlpanlvsoapqigklscrj^ 
hhplfiiwgrwiavkhletvqsgiiaslgfvgqhtshgpp 


6679 


2 


786 


IiEFARGAMPFLGQDWRS PGO^WVKTVDGWKRFLDBKSGSFVSDL 
SSYCHKF^TYNKENLFNSIiNYD/ SCSQBEW3GHAE*QNfGNS \DPH 
QEKWIYVHKGSTKERHGYCTIjGRAFI^^ 
RLLELIAKSQLTSI^IAQKNFMNII^KVVli^^ 
FJJiQTLYTSIXrTLVKRVGKSVLVG^ LHWQQQLN 
OTQITRVBGO^QPPPGSGSLHRDTOQTRQDFBPTPVTEESGIiF 


66B0 


■ ■ 1498 


2951 


PLCTLPL.VJPS ALPGWAGKRWEKQWPIA/ PGPGTWQTPVGS ISKS 

p\rknbpothcprgbarpb^*hlpkphspgsbgakiqtsa*alp 

/NQVSPPQPM*GAEENGDORGGKEEAGEELHRSSSGLTAAPGF? 
EVKRNI^FPGLPSRGGGP/GGAGTOGSWAPGKQPP/SPLLPAS 
KQRS QAGLPG WEAGLVBS PT1IHIPALRPSGTNATGBAFPSTTC6 
SGP\PAPPGPTGIiRPGGGSSSGGHG* * PGLPVGKV\GALGAAQD 
PQSQGRGPTG^TVGTBMLIkSGLGSAKACPAARPAVP *LPSDPAS 
TIPKKGTRGFGEGPGVLQERNRWWGRAQGFTSADAAGTAPPGV 
* LPAPIiSQPPGATEPQVRACGMAPPS PGTSGRLVAWGRHPGPQV 
AQGCPPGAGCWGSQPRGSQRCPRTYTHSPLGHGRAPCPRRCWH* 
WQDP PSSpRTGCLPGl PARQAYSAPRTRSRPG IRTGRAAYGFIR 
FQGGGGG 
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SEQ 
ID 

NO; 


Predicted 
beginning 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A*= Alanine, (^Cysteine, D=Aspartic Acid/ E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H^Histidine, I=lsoleucine, K-Lysine, 
L^Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V«Valine, 
WeTryptophan, YwTyrosine, X=Un3oaown, *~Stop 
Codon, /^possible nucleotide deletion, 
\wpossible nucleotide insertion) 


6681 


1169 


511 


INyiYYWQQQRAFHBLKXKKIiMSAPAIjGliPDLTKIjFTLHVSKRE 
KMTVGVLTQTVGPWSRPGAYLS KQLDGVSKGWPPCPRALAATAL 
IiAQEADKl»TLRQNnbNRKSPHA\VVTLINTRGHH*LINARLTRYQ 
TLLC^KNPHKTI EVSNT/IJ^PATLLLVTBS PVKHNCLBVLDS VYS 
SRP1H J RDHP*TSVDWKLYVDGSGFANPCKVTLKKETSPAPVTPR 
S 


6682 


109 


1238 


TVLC<^QVSSIiNEVKIYSI^C«KSI*PEI^ 
VRRRI^IQDFEMPTVCTTIKVSIOXKJYILATXSrYTCPRW 
YQI^LKFERCLDSEVVTPE II^DDYSKIVFLHHDRYIEFHSQSG 
FYYKTRIPKJTORDFSYHYPSCDLYWGASSEVYRUJr^QGRYI^ 
PLQTOAAKNWCD INS VHGItPA 

D\AP*WSQQIQR*TSI,PTISALKFN\GALTMAVGTTTQQVIJiY 
DLRSDKPIJ^VKDHQYGIiPIKSVHFQDSI^LIIaSADSRIVKMWNK 
NSGKIFTSLEpSHDLNDVCLYPWSGMLLiaNETPKMGIYYT PVL 
GPAPRWCSFLDNLTKELEKNPBSWB 


6663 


109 


1238 


TVItOGAMQVSSLNEVKI YSIjSCGKSLPBWLSDRKKRALQKKDVD 
VRPJ^IBLIQDFEMPTVCTTIKVSKIX^YIIATGTYKPRWCYDT 
YQI»SLKPERCI^BVVTFKir^DDYSKIVFLHNDRYIKFHSQSG 
FYYKTR I PKFGRDFS YHYPSCI>LYFVGASSKVYRLNIjEO^R YtiN 
PLQTDAAENNVCDrN3VHGL»FATGTIKGRVECWDPRTRNRVGLL 
D\AP* TVSQQ I QR * TSLPTI SAI»KFW\GAIiTMAVGTTTGQVLIjY 
DLRSDKPLIiVKBRQYGbP I KSVHFODSLDLILS ADSR1 V KMWNK 
NSGKI FTSLB PEHDIJTOVCL YPNSGMLLTAKETPKMG I YY I PVL 
GPAPRWCSFLDNI.TKRLEKMPKSNE 


6684 


111 


527 


GLRGGT SRGRAGRE PRFAAGVLCVVAGPCQS PCPPGGRGRKAPA 
PP\SGRRHA*RPA*WLGGPGGDSGGREEGGS /GELQRAMESKMG 
ELPLDINIQEPRWDO^TFI^RAPJIPFTVTDPPJIiaJ^GAQI^AS 
RNIVQNYR 


6685 


258 1 


1473 


KLLGDNFEGFCNKFELSDSSNGSNS *QSPL\FDRJbFDPDPQKVL 
QGVI DMKN^VIGNNKQKANI* I VljGAVPRIXyiXQQKTSS TELKT 
ECAVVU^IAMGTENNVKSLLDCHI I PAU^QGLLSPDLKF IEAC 
LRCLRTIF^SPVTPEEIiYT'DATVI PHLMALIiSRSRYTQBYICQ 
IFSHCC^PDHCrril/FmiGAVp^IAHLLTSl^YTWRMQALKCFS 
VLAPENPQVSMTLVim»VI>GBIJiPQI FVKMLQRDKPIEMQLTSA 
KCLTYMCRAGAIRTDDNC i VIiKTLPCIiVR^KERLIiERRVEGA 
BTIAYI>IBPDVBI^RXASITDHiaJUvll^YFKYPSSV5AITDIK 
RIEHDLKHAHBI*QAAF10.YASU^^ 
XASRQGVTST 


6686 


310 


927 


l^VTFDDIJWDFTPKEWTLLDPTQRNLYRDW 
QLPKPSLISWI^QEESRTVQRGDFQASEWKVQLKTKEIjALQQDV 
LGEPTSSGIQM IGSHNGGE VSDVKQCG^VSSBHS CLKTHVRTQN 
SBNTFE CYLYGVD FliTLHKKTSTGEQRS VFSHVWKKPS SLNPDV 
VCQKHRCTRKKKAF* LQI/TLGKSPH* S IHT 


6687 


iai 


915 


EAMIJBAPYKKEEDEQQRKEVKKDYPSNTTSSTSNSGNBTSGSST 
IGETSNRSRDRDRYRRPJSTSRSRSPGRQCRHRSRSWDRRHGSESR 
SRDHRREDRVHYRS PPLATGEPVDNLS PEBRDARTVFCMQIAAR 
IRPRDLEDFFSAVGKVRDVRI isdrnsrrskgiayvefceiqsv 

PLAIGLTGQRLLGVP I IVQAS QABKNRLAAMANNLQKGNGGPMR 
IiYVGSLHFNTTEDMLRGIFEPFGKV 


6688 


1025 


X 


AEVPOTPRVFHKCPDSCWRFKFQPIQI^PYIIiSFSSE10?PISF 

SEPGLPR/SATARMATAAAPPNSSIDLPSDSGMGFIS PAGDSLD 

LPSDGGTGFFSLAGDSSSTRIiSSLAFISFSLSSVSVGSSAGTTS 

STSVGSVVAAPTSSSSSSTCraDVAGLDPSTVITSVSGSliVPS 

YAVICGSKGAGASGSASCSSRAGKTTEATAASSMPSGTSSFSTC 

TMSKLEELFSIUPSPAPLLSKLFTSSGS IAICCQDSGPSDTGRLS 

VCQliS*LAD3inX3KIiSIX3QEVVTVGDS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 

\*\J-L JL CDUUilUlliU 

to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A-Alanine, (^Cysteine, EteAspartic Acid, E- 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H»Histidine, I=lsoleucine, K-Lysine, 
I*=sLeucine, M=Methionine, N-Asparagine, 
P^Proline,' 0=Glutamine, R=Arginine, 
S=Serine / T=Threonine, V-Valine, 
W ^Tryptophan, Y^Tyrosine, X=Unknown, +=Stop 
Cod on, /^possible nucleotide deletion, 
\wpossible nucleotide insertion) 








SSAVIPGYSSSSDSRLNTVPTVDLLCPFQTKS5T 


6689 


640 


1299 


SSSASYATSATSISDTAFSGSLKLKHGLLSALDSSSRTS*STSS 
ABDSTFRICSPSVSDTSSDSSGSKDNVL1LFSKVSI *SCFSLSS 
FFSDSISPCFSSSSFC101*FVSS1CV«QMAI»I£SRLSNGPG(3SSK 
QRNS LTARQLAMSIj * ATKF * RNACNPNCLSSKKSAL* LSLNQR F 
GGSASRKPGNISPWSQKCSALS YCXWFVI KPRBVSVSSBNYPAF 


6690 


1 


442 


GTRGKMAATI^PI^WOJWRRCIiSARira^ 
QQVGAGQTFKYLKREHSLSKPYQGVGTGSSSLWMIM3NAMVMTQ 
YlRJLTPDMQSKQGALWNR\m^LRDWHU?VH^ 
GDGLAIWYTKDRMQP 


6691 


287 


1401 


LKTETS EEKARR YKDR PSQLNAVFQEQKKM IQAQBS I TIjED VAV 
DFTWEEWQLIX3AAQKDLYRDVMLBNYSNLVAVGY0ASKPDALFK 
LBQGEQLWTI BDGIHSGACSDIWKVDHVLBRLQSESLVNRRKPC 
HBHDAFEN1VHCSKSQFLLGQNHDIFDLRGKSLKSNLTLVNQSK 
GYEI KNSVEFTGNGDS FLHANHBRLHTAIKFPASQKLISTKSQP 
ISPK1IQKTRKLKKHHVCSECX5KAFIKKSWLTDHQVMHTGEKPHR 
CSLCRKAFSRKFMLTEHQRTHTGEKPYBCPECGKAFIJCKSRIJjn: 
HQKTHTGEKPYIC^BCGKGFIQKGNLIVHQRIHTGEKPYICNEC 
/GKGFIQKTCLIAIIQRPHTER 


6692 


178 


939 


WIKBGKLSLWERFCANI IKAGPMPKHIAFIMDGNRRYAKKCQVE 

RQEGHSQGFNKLAETIiRWCIjNLGILBV^ 

IXSLMDIiARQKFSRLMKEKEKXjQKHGVCIRV^ 

IAQAVQATKNYNKCFl^CFAYTSRHEISNAVREMAWGVKQGLL 

DPSDISESIJJ)KC^YmiSPHPDILIRTSGBVRLSDFIJ,WQTSH 

SGLVFQPVLWPEYTFWNLFEAILQFQMNHSVLQK 


6693 


178 


939 


wikbgelslwerfcani ikagpmpkhiafimdgnrryakkcqve 
rqegiisqgpnkij^lrwclnix;ilevtvyafsienfkrs^ 

IXSLMDIiARQKFSPJ^JEEKEKI^KHGVCIRVLGDIJnjbPiDLQEL 
IAQAVOATKNYNKCFLNVCFAYTC 

DPSDISESIiLDKCLYTTmSPHPDILIRTSGEVRLSDFIJLWQTSH 
SCLVFQPVLWPEYlTWNLFEAIIiQFQMNHSVLQK 


6694 


292 


813 


SliLLHLAPPGAYTPSQPLSSVSTETASSVRRQAAESRQHELPVR 
EVHSLGQII>PQDGLTAEAGPPBAQDPWGSPGISLPAAHIGFAAA 
IAVGPSGCHTEP\FDEVWPSLPIiGDAYAARDKSKLlQIiGITHVV 
NAAAGKFQVDTGAKFYRGMS LEYYG I EADDNPFFDLS VYFL P 


6695 


292 


B13 


SIiLI^KJiAPPGAYTPSQPLSSVSTETASSVRROJUVESROiiELPVR 
EVHS LGQ I LPQDGLTAEAGP PEAQDP WGSPGI SLPAAH IGFAAA 
LAVGPSGan^P\FDBVWPSLFXGDAYAAPJ)KSKLIQI^ITHVV 
NAAAGKFQVDTGAKFYRGMSIiEYYGIBADDNPFFDLSVYFIjP 


6696" 


1 


782 


FKVKbKVtjCKWAlfJj^VPAAMS^fiMEPlilJjAWSYFRRRKFQLCAD 
1»U J-WPJIjCI^oF J UVAAW X UKf\KI\L} 1 EM V 1 IDE 1 DVDQBGIAEMMIj 

DENAIAQVPRPGTSIJajPGTNQTGGPSQAVRPITQAGRPlTGFL 
R PSTQSGRPGTMEOAIRTPRTAYTAR P I TSSSGRFVRLGTASML 
TSPDGPFIM*SRimrKYSQKPKLAKALIEYI FHHENDVKTALD 
1AALSTEHSQYKD»WWK/DQIBKCYYRVGMYREAEKQIKSS 


"1*97 


3 


782 


PPLFTiRRLNSRAIiRPGSRXVMAVVPASIiSGQDVGS FAYTiTI KDR 
IPQII>TKVIim»RRHTCSBFFEKHGEEGVEAEKKAISU^KLRNE 
LQTDK^FIPLVEKFVDTDIWNQYIiEYQQSUjNESDGKSRWFYSP 

wllv\ecymyrriheai \ iqs ppu3yfdvfkeskeqnfyg2qes 
iialcthlcxjlirtiedld\bi^ucdeffkli^islwgbisvdl 
slXsggesssontwvij^sledlkpfillndmeiilwsllsnck 


6698 


(J6B 


754 


vgscacagscjcckeckctsckksecrafp 


6699 


325 


492 


EGELP/PARRVIiPRAMTASAQPRGRRPGVGVGVVVTSCKHPRCV 
LI/5KRKGSVGAGSFQLPGGK^jEFGKTWEECAQHKTWKEAALHLK 
NVHFASVVNSFIBKENYHY^IIJ4KGBVDVTBT)SRPKNVEPEKN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=*Alanine, C= Cysteine, D=Aspartic Acid, E= 

Glutamic* Acid FnPhenvl »1 An4 rip CI — Olvr* - * tif» 

H=Histidine, I=Isoleucine , K«I»ysine. 
L=Leucine, M=Methionine, NT=Asparagine , 
P= Proline, Q=Glutamine , R=Arginine, 
S=Serine, T^Threonine, V= Valine, 
W-Tryptophan, Y=Tyrosine, X=Unknown, *«Stop 
Codon, /=poseible nucleotide deletion, 
\=possible nucleotide insertion) 








ESKRI I YNHAFFFQBSKWSGGILQ 


6700 


1098 


1392 


TQCWRS STPGMRTHFRTQP /RLECGQGFSQQEKGHCMDTNECIQ 
FPFVCPRDKPVCVNTY6SYRCRTNKKCSRGYBPNKDGTACVERT 
LLLGLCNLLGK 


6701 


2 


1485 


AAAGPRTRVRRAAAFEGQPS PSPGLGPTSDKAAAPRTPKRRRLW 
RQRQ/HPAMLC^nTTRPDAVI*^ 

IBWYFGI^FTGSKQESLWLNLRNRISQQMDGLAPYRLKLRVKF 
FVRPHI*! LQEQTRHIFFLH I KKAIiIiAGHLLCS PEQAVELSAU A 
QT KFGD YNQNTAKY N Y EE LCAKKJLS S ATLN5 I VAKUKE L.EGTS Q 
ASAEYQVLQIVSAMENYGIBWHSVRDSEGQKLlilGVGPKGISIC 
KDD FS PINR IAYP WQ^TQSGKNVYLTVTKESGNS I VLLFKM I 
STRAAS GEYRA I TETHAPYR CDTVTS AVMMQ YS RDLKGI ELAS L F 
LNKNINlGKKyvPD I KRTS KEVYDHARRALYNAGVVDLVSRNNQ 
SPSHSPLKSSESSMNCSSCEGLSCQQTRVLQEKLRKLKBAMIiCM 
VCCBBE IHSTFCPCGHTVCCESCAAQI^VGKSAAHFCLQPHLSL 
LLTGSRSQVLAR 


6702 


397 


1971 


PLAFCFLKLDLVNVI»CLPMEDVFlJ?ro 

RAEAMiCSRKATVVRDLVAVRMAEEQEFTQW^PAQPSHPHW 

NNa-YKSAOHSOALLRGLLALRDCGILFDWr.WEGRHIEAHRIL 

IJ^CDYFK£^FAG6LKEMEQEBVLIHGV3YNAMCQItOTIY^ 

ELELSL3NVQETLVAACQLQI PE I IHFCCBFLMS WVDEENI LDV 

YRLABLFDl^RLTEQIJDTYILKNFVAFSRTDKYRQLPLEKVYSL 

LSSNRLEVSCETF^YKGALLYHYSLEQVQADQISLHEPPKLLET 

VRFPI^EAEX^QRIJIDKIiDPSPl^DTVASArirWiRNESI^PSLQ 

SPQTELRSDFQCWGFGGIHSTPS\MSSATRPKYIiNPLLGEWKH 

FTASIAPR^NQGIAVLNNFVYLIGGDNNVQGFRAESRCWRYD 

RHNRWFQ IQSIjQQEHADLS VCWGR Y I YAVAGRDYHNDIiNAVER 

YDPATNSWAYVAPLKREVYAHAGATLEGKMYITCGRKGRIT 


6703 


45 


1244 


GVGPRAAAMPEiELEIiCPGRWVGGQHPCFIIAKIGQNHQGDLDVA 
KRMIRMAKECXaAjKAKFQKSELEFKFNRKALERPYTSKHSWGCT 
YGjaHKRHLEFSHDQ YREIiQRYAEEVG I FFTASGMDEMAVEFLHE 
LNVPFFKVGSGDTNNFPYLEKTAK/TRGWHSVt^VCGVQLNDE 
TSSWDVIGRVRTSKEKVUWLVI^YSGRPMVISSGMQSM^ 
VYQ I VKPLKPNFCFLQCTSAYPIjQPEDVNLRVISEYQKLFPDI P 
IGYSGHETGIAJ^VAAVALGAK\niiERHITIiDKTHKGSDH^ASl£ 
PGEEABLVRSVRIjVERAIjGS PTKQLIJ?CEMACiraKLGKS\Af AKV 
KI PEGTILTMDMLTVKVGBPKGYPPEDIFNLVGKKVLVTVEEDD 
TIMBE 


6704 


B2 


1007 


TMNTRX^VVNSGLGASPASRPTRDPQDPSGRQGBI>SPVEDQREG 
LEAAPKGPSRESVVHAGQRRTSAYTLIAPNINRRNBIQRIABQE 

YKQKLKREESVRIKKBABEAEIK2KHKAJCQRBKSNKLEBKKRI4)B 
NLRREAFREHQQYKTAEFL/RQTFaiRIARQKCLSKCCLWPTILN 
MGQKIiGLQ \ DSJUKAEENRKLQKM KDEQHQKS EI>I*EEKRQQQEQK 
RAKIHQTEHRRVNNAFLDRLC^KSQPGGLEOSGGCl^NMWSGNSW 
GI 


6705 


2 


786 


rlcrnsarvpckwsasrslgegagfigplrgphpraggtgtsft 

SYKRKGGIMSTIAAFYGGKSILITVATGFLGKELMEKLFRTSPD 
LKVIYTIiVRPKAGQTLQHRVFQlIiDSKIiFEKVI E VR PNVHEKI R 
AI YADLNQNDFAISKEDI^EIiI^CTTTC IF^ 
VQIiNVlTATRQIiLLMASOMPKLEAFIHISTAYSNCNLIOIIDEVIY 
PCPVEPKKI IDSLEW\LDDAIIDEITPKLIRDWPNIYTYTK 


6706 


130 


531 


FTHSS SSR^QEMIX3KLNMLRNIX5H FCD I T I RVQDKI FRAHKWL 
AACSDFF11TKI>VG0^EDENKNVIJ)LHHVTVTGF I PLLE YAYTAT 
LSINTBNI IDVIAAASYMQMFSVASTCSEFMKS3XLWNTPNSQP 
EK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A» Alanine, C= Cysteine, D^Aspartic Acid, B= 
Glutamic Acid, P«»Phcnyl alanine, G=Glycine, 
H=Hiatidine, I=Isoleucine, K~Lysine, 
I»= Leucine , M=Me t h ionine , N= Aspa ragine , 
P=3proline, Q^Glutamine, R=Arginine, 
S=Serine, T=*Threonine r V^Valine, 
WaTryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6707 


2233 


1343 


Y WSGIGWEIXJHPHWRKFHPBKKGPPSTCQBRIiYESRSRWPCIS * 

GMVWGWTAVNGSW * C5GQURCVCVCTSHSSDSTRSSQRAS KCHS 

FFIIiSQ*KT*SSWENWVFAKYSRIYSVGHSCSKGRGD*DFK*KV 

SQAR*SRFOGLCOTCGHTOLDIMLRGGSSPW^ 

NRRVFSIiLCEGPGHCYQGAVraBACAAASPGLD^ 

TD*LPK*GPGYIC»^CDSNILCILYNISPNLFSYSP*GVARYA 

C*RCHWYFBWI*LYNHCGDILVACL*RRQL*SSQ 


6708 


115 . 


1729 


TVGS WSRSGRS PPVGRQLLiLTGRGAQAAGS PQGGMALQVELVPT 
GEI IRVVHPHRPCKIALGSIXJVRVTMESALTARDRVGVQDFV 
ENFTSEAAFIENbRRRFRENLIYTYIGPVLVSVNPYRDTiQIYSR 
QHMERYRGVSFYEEPPHiLAVADT\nrRALRTERRDQAVMI SVES 
GAGKTDATKRLLQLYAETCPAPQRGGAVRDRLLQSNP^RAFGN 
AKTbR^NSSRFGKYMDVQFDFKGAPVGGKTI^YlJiEKSRVVHQ 
NHGBRNFHI FYQl^GGEEETLRRLGbERNPQSYbYbVKGQCAK 
VSS INDKSDWKVVRKALTVIDFTKDBVEDLI>S IAASVLHLGNIII 
FAANF^SNAQ VTTBKQIiKYLTRIJ>S VEGSTLRKALTHR KI IAKG 
BELLSPIjN1»EQAAYARDAIiAKAVYSRTF^ 

BSPS^n^TTVLGLLDIYGFBVFQRWSFEQFCINYCNfKKLQQr>FI 
EbTLKSEQKRYEABGIAWEPVOYFWNKIICDLVBSKFKGI I\SI 
LDE\BCLRPGE 


6709 


3 




PPHEHLFPSGERGPFSFLVSRRGLGPGKMGKKGKKBKKGRGAEK 
TAAKMBKKVSKRSRKEBRDLBALIAHFQTLDAKRTQTVEIiPCPP 
PSPRIiNASLSVHPEKDSX.ILFGGEYFNGQKTFLYNBLYVYNIRK 
DTHTKVDI PSPPPRRCAHQAVWPOGGGQr ; WVFGGEFASPNGEQ 
FYHYKDLWOHIATKTWBOVKSTGGPSCT^HRMVAWKRQIiIIiF 
GGFKESTRDYIYYOT^AFNIOTFTWSKLSPSGTGPTPRSGGQ\ 
I PSLPRAASSVYGGYSKQRVKKDVDKGTRHSDMF 


6710 


158 


980 


RHKMimRVESSSGRAARKMRLALMGPAFXAAIGYIDPGNFATN 
IQAGASFGYQXiLWVVVWANLi^LIQirjSAKIjGIATOKNIiABQI 
RDirYPRPVVWFYWVQABIIAMATDl^FlGAAIGFKLIJ^VSIJL 
QGAVLTGIATFLI LMliQRRGQKPLKKVI GGbLLFVAAAY 1 VEbl 
FSQ PNLAQIiGKGM VIPSL PTSBAVFLAAGVIj \GATIMPHVI /YT 
WHS SKTQI ILHGGS RCXJR YSATKWDVAI AMTIAGFVN LA I MATAA 
SBLNFYGHTGVA 




3 


347 


VTECKTMTCKMSQLBRNI *TMI2mflHYSVKLGHPDTI,IHGBFK 

BLVRTDUINII*MKBNKNDQ^*RIMEDU7FKAR^ 

MAMLTWSYHDNMHDADYGPGQQHRPG 


6712 


118 


57B 


PHGQKRTRYPQVRAPC^PQAQLAMAL^^ 
FEPGTQRFELYKKAQASLKSGI^riRSVVRLPPGKNIDDWIAVHV 
VDFFNRINLIYGTMABRCS *TS CPVMAGGPRYEYRWQDRRQYRR 
PAKLSAPR YMALLHDWI RSbl 


6713 


2485 


3 


QARGSDSBDGKFBIQAKDDARARKLGPGRPbPTFPTSECTSDVB 
PDTREMVRAQ^^CKKKKSGGFQSMGLS YPVFTCG1 MKXBYKV PTPI 
QRKTI PVIIJX3KDVVAMARTGSQKTACFLLPMFERLKTHSAQTG 
ARALII^PTRPIiAIOTIiKFTKEIiGKFTGLKTALIIiGGDRMEDQP 
AAIiHENPDI I lATPGRIiVHVAVBMSIJa^VBYVVFPBADSLFE 
MGFAKQLQEI lART.PGGHQTVLFSATLPKLbVBFARAGbTEPVt, 
IRLDVDTKIKEQLKTSFFLVREDTKAAVIJJ^^ 
VF^TKl^HAEYLTEIiTTQRVSCAHIYSAIJ)PTARKINIJVKFTL 
GKCSTLI VTDLAARGLDI PI JjDNVINYS FPAKGKLFbHRVGRVA 
RAGRSGTAYSLVAPDEIPYI^LHLFliGRSLTLARPLKEPSGVA 
GVIXiMbGRVPQSVVDEEDSGLOSTljBASIiBLRGIARVADNAQQQ 
YVRSRPAPSPES I KRAKEMDLVGICI»HPbFSSRFBEEBLQRIiRb 
VDS I KNYRSRAT I FEIWASSR1)I>CSQVMRAKRQKI>RKAXARFQQ 
GQC^RQEQQEGPVGPAPSRPALQEKQPKKKEEEEAGESVBDIFS 
EWGRKRQRSGPWRGAKRRRBEARQRCQRPYIPYRPKDFDSSRG 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E- 
Glutamic Acid, F= Phenyl alanine , G=Glycine, 
HsHistidine, I^Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=?Asparagine, 
Pt=Proline, Ch=Glutatoine , R=Arginine, 
SsrSerine, T=Threonine, V»Valine, 
W=Tryptophan, Y=Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«po3aible nucleotide insertion) 








LSISGEGGAFEQQAAGAVIiDU^DKAQl^TRGRQQLKWDRKKKR 
FVGQSGQEDKKKI KTBSGRYI S SSYKRBLYQKN KQKQKI D* S * L 
GRRRGILTRRRPRTEEVGKAR PLAQAGCI PGPHAPRHPLQABSA 
LBLKTKQQILKQRRRAQKAALSLQRWWPQAALCPQ 


6714 


169 


1416 


l^CQKI»LPPPPAPMAHIPSGGAPAAGAAP^PQY(^CacVEI^VS 
GQNLfcDRDVTS KSDPFCVLFTENNGRWIBYDRTETAINNLNPAF 
SKKFVIflYHFBEVQKLKFALFIW^ 

TIVSSKKITRPLI^IJfl3KPAGKGLlTIAAQBr^DNRVITLSLAG 
RRIiDKKDLFGKSDPFLEFYKPGDDGKWMLVHRTEVI KYTLDPVW 
KPFTVPLVSLCIXJDMBKPIQVMCYDYDNDGGHDFrGEFQTSVSQ 
MCEARD5VP1.EFECINPKKQRKKKHYKN9GI I ILRSCKINRDYS 
P^YIIGGCQLMFTVGIDBTASNGNPLDPSSLHYXNPMGTNBYIi 
SAIWAVGQIIQDYDSDKMFPAI^PGT^QIiPPDWKVSHEFAINFNP 
TNPPCSGVDGIAQAYSACLP 


6715 


32 


493 


GPAGAESGSLHCLPATVQAI»flGAAHSPHGGGPPRRGPLIGSGMP 
GKPKHLGVPNGRMVLAVSIX5BLSSTTGPQGQGEGRGSSLSIHSL 
PSGPSSPFPTKEQPVASWAI>3FBRLLQDPIX3LAYFTBFI>KKBFS 
AEMVTPWKACBRFQQIPASDT 


6716 


1 


176 


GAGGPAPRSFGSBBPRAALBRDKMSARAAAAK3TAMBETAIWBQ 
BTVTLHRVSI>CC3K 


6717 


115 


896 


LFAMSGFENLNXDFYQTSYSIDDQSQQSYDYGGSGGPYSKQYAG 
YDYSQQGRFVPPDMMQPQQPYTGQIYQPTQAYTPASPQPFYGNN 
FBDBPPLLEEU3I>IFDHIWQKTLTVLHPLKVAIX;SIMMETO 
PMVFCIAFGATLU^KIQFG YVYGI SAIGCI>GMFCrlLNLMS^3T 
GVSFGCVASVLGYCLLPMII,LSSPAVI FSLQGMVGI ILTAGI IG 
WCSFSASKlFISALAMBGQQIiLVAYPCALLYGVFALISVF 


6718 


290 


599 


KQ3STVPGTILPSLKWHNSGI*CKFPKTGGKMTTFKEGIiTFKDVA 
VIFTEEPJXLLDPVQROTiYQDVMLKNFRNLI^VGHHPFKIIDVFIi 
LEKEKKLDIMKTATQ 


6719 


1 


691 


PTCPKKQDREIX5KCMKMHMNPISGNI^^ 

G^AHCKSSKARTGIJ^lTRRCAATAANKIIGjM^ 
RSKlTORKKPLHIiACTTAKKKLSIX^ 

DPDSBGSTKVLSQALNGDSDSBDMI^SRHKHRHTNIHKIDAPSK 
RKSSSVTSSG 


6720 


3 


B22 


HBVARFJVGGTVYPQRGTMPGTKRFgHVIF/TPEPGKWELTGYEAA 

VP I TBKSNPLTQDUDK7ADAEK IVRIJjGOCDAE I FQEBGQALS TY 

QRLYSES II>TTtWQV2V3KVQBVIjKEPDGGLWLSG 

U^V8FNQIJ4KG1WKPX*YTYLIWOTRSVVASRBG^ 

IEELKKVAAGKKRVIVIGISVGLSAPFVAGQMDCC^IKNTAVFLP 

VLVGFHPVSMARHPFPPPRILRSLTVFPSM^HYQITSLI.FSM 

SWTLISB 


6721 


3 


822 


H^VBEAGGTVYPQRGTMPGTKRFQHVIF^PEPGIGfFXTGYKAA 
VP ITEKSNPLTQDLDKAEAFJJIVRLLG 

QRLYSF^ ILTTMVQVAGKVQEVI JCBPDGGX*VVI^GGGTSGJRMAF 
XJ^SVSFNQIMKt^iGQKPXjxTYLIAGGD 

IEELKKVAAGKKRVIVIG I SVGLSAPFVAGQMDCCMNNTAVFI.P 
VLVGFNPVSMARHPFPPPRILRSLTVFPSLRAPHYQI TSLLFSM 
SWTLISB 


6722 


1 


390 


RSWSKRTWQALPMAVI^t^FL(^TPO>ADiMQAIYVALGEAVE 
LPCPSPSTLHGBKHLSWFCSPAAGSFTTLVAQVQVGRPAPDPGK 
PGRESRLRIXGNYSLWLEGSKEEDAGRYWCAVIiGQHHNYO^W 


™*723 ■ 


173 


659 


VCQYCTARMADFGISAGQWAVVVro^ 

GNEGRVSVENIKQLLQSAHKESSFDI ILSGl,VPGSTTIiH5AEIIi 
AEIAR I IiRPGGCLFLKEPVETAVDRNS KVKTASKLCSALTLSGT. 
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NO: 
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nucl eoti.de 
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to first 
amino acid 
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amino acid 
sequence 
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location 
corresponding 
to first 
amino add 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine f DsAspartic Acid, E= 
Glutamic Acid, F= Phenylalanine , G^Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L-I»eucine, M=Methionine, N^Asparagine , 
P=s Proline, Q=Glut amine, R^Arginine, 
S=*Serine, T=Threonine, V=Valine, 
W ^Tryptophan., Y- Tyro sine, X-Unknown, **=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








VKVKELQREPLiTPKBYQSVRBHLGHESDNL 


6724 


173 


659 


V CX)V CTTARMADTOISAGQFVAVVWDICSS PVEALKGLVDKLQAI»T 
GNKGRVSVBNIKQIJ^SAHKBSSFDIILSGLVPGSTTIiHSAEIL 
AE I ARILRPGGCLFLKE P VETAVDNNSRVKTASKLCSALTLS GL 
VKVKBIiQRBPliTPEEVQSVRKHI^HESDWL 


6725 


356 


722 


RJ^TPPVlIATMDDDLMLfc^ 
ASWELVDPTPDIXBLFVQFNIX)PFWG^ 
1 LSI i^lu^nCS LKl^aPuLtKliK 


6726 


98 


714 


HI^KMBRKI»RRBKBKBYEGKmSlJ2D^^^ 

GYLY ITQKQTLT KX PDTFLEGI VNGKI LCPFDADGHY P I DRDGIi 

ZiFRHVUTFIiRNfGEIiLLPBG PRKNQMjAQEAEFPQLRGIAEEVKS 

RWKKEQLTPRB'WriiE ITONHDRSQGliRIFCMAPDFIS KIK5RX 

VLVSKSRLDGFPEBFSISSNIIQFKYFIK 


6727 


1 


831 


FRGMGDERPHYYGKHGTPQKYDPTFKGPIYNRGCTDliCCVFLL 
LAIVGYVAVGI IAWTHGDPRKV1YPTDSRGEFCGQKGTKNENKP 
YLFYFNIVKCAS PLVLL^FQCPTPQI CVEKCPDRYLTYLNARSS 
RDPSYYlCQFC\m3FKNNKGVAEVLRDGDCP AVLI PSKPLARRCF 
PAXHAYXGVLMVGlIETrYBrK3HGSRK^ 
RQIiAI^IFBDYWSWYWDIISI^IAMAMSLLFIIIiIiRFI^ 
RtM I IMGILVLGY 


6728 


486 


935 


FCSSWLRSUAOSSLSWKMFIjVGI.TGGrASGKSSVIQVFQQLGCA 
VI DVDVMARHVVQ PGYPAHRRIVK VFGTBVLLBNGD INRKVLGD 
LI FNQPDRRQLJJQMTHPE IRKEMMKBTTFKYFLRBPRTS PRGKK 
HVPSALKEADSLMRRDT 


€729 


259 


1191 


VGLTGAQSGRTASMGRDQRAVAGPAIiRR^ 
LAGYKKFDVTCGGRIXISGGCQCrYPE 

G^QG F PGLQGRKGDKGERGAPGVTGP KGDVGARG VSGFPGADG I 
PGHPG<X3GPRGRPGYIXKWGTQGDSGPQGPPGSEGFTCPPGPQG 
PKGGKGEPYALPKEERDRYRGEPGBPGLVGFQGPPGRPGHVGQM 
GPVGAPGRPGPPGPPGPXGQ<X5NKGLGPYGVKGEKGDVGQPGPN 
GIPSOTIiHPIlAPTGVTPHPDQYKGBECGSEGEPGIRGISLKGEE 
GIM 




784 


1015 


NMVDYYEVLGLQRYASPEDIKKAYHKVALKWHPDKNPENKEEAE 
RKFKK VAEA YBVLSNDEKRD I YDKYGTEGLNEF 


6731 




446 


RTRKWT iHf4A\/VPPVR\yil^r^WT?TPR.QT?rrVITT .RRPTQPT .KNNDRttR 

r^IYAGI^2AVSDSASKSCVPSRNCI.DLYBEXLTEEGT3uIEATY 

NDLQVKYGKCQI^MKELMKK^ 

LIKTARVEtNRKDBBr 


6732 


102 


1205 


GRWQRRPPPPSPPLWCLQPGGGSDPQQIiTQIiRHCLSHSPQDTPW 

AQRQ VCYTAATTQAAAPATRNCLPDHSGHRPTP PRS HRHHRQRN 

LGSIKPSSRSTKATSTIT^GIXSRRAEAVREGWGVYVTPRAPIRE 

GRGRIiAPQNGGSSDAPAYRTPPSRQGRREVRFSDEPPBVYGDFB 

PLVAKERSPVGKRTRLEEFRS DSAKEBVRESAYYLRSRQRRQPR 

PQETEEMKTRRTTRLOXK^EQPPLQPSPVHTRRGIiRDSHSSBE 

DE^SG^I^OTISKKTVRSIQEAPAVSED^^ 

YBATSV0^KVNFSEEGETEFJDI>3DSSHSSVTTVKARSRDSDESG 

DKTTRSSSQYIESFW 


6733 


613 


1311 


RSCRQVGMRSRNQGGESASDGHISCPKPSI IGNAGKKSLSEDAK 

KKKKSNRKEIH3VMASGTVKRHLKTSGECER 

QI^IMEGEI^AREDVTHMLKTEKTKPEVLEAfrYGSAEPEKVLR 

VLHRDAI IAQEKS IGEDVYEKP I SEJjDRLEEKQKETYRRMIxEQLi 

LLAEK(^RrVYEliEWEKHKirTDrTO«CS 

IiLEQEKAYQARKE 


6734 


183 


551 


SAAMFPWSGCFQEXQEKNK5LELVSFEBVAVHFWBEWQDIiDD 
AQRTLYRDVMIiET YS SLVSLGHCI TKPEM I FKLEQGAEPWIVBE 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(Alanine, C= Cysteine, D-Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine. G=Glycine, 
H=Histidine r I=*Isoleucine, K«Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine , R=Arginine, 
S-Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y»TyroBine, X-Unknown , *=»Stop 
Codon, ./-.possible nucleotide deletion, 
\=possible nucleotide insertion) 








TLNLRLSGGS KKQVFSGICHRSIiVBLQBVHLV 


6735 


280 


558 


KSRRAGVTKMSNPFLKQVFNKDKTFRPKRKFKPGTQRPEIiHKKA 

QASLNAGI^IJRLAVQLPPGEDIJJDWAVHVV^ 

XDGCT 


6736 


195 


808 


MNYEI^PKREMPNIKSI^LTNLNFIiLKRLSSVIiPLlTDYVyFBN 
SSSNPYLIRRI EKLNKTASGNVEAKVVCFYRRRDI SNTLIMLAD 
KHAKE IEEES ETTVKADIjTDKQKHQLKHRELPLSRQYKS LPATH 
IRGKCSVALLNEl^SVI^YIJDKEDTKFYSLVYDPSLKTLIADKG 
BIRVGPRYQADIPEMLLEGTFPCVPAVL 


6737 


ISO 


1209 


PVIMPLHFS PGDI VRPS CCVSS SPKLRRNAHSRIiE5YRPDTDLS 
REDK^^HISDRENIDDI^^FNPSDHPRASTIFLSKSOTTDV 
REKRKSLFINHHPPGQIARKYS3CSTIFLDD3WSQPNLKYTIK 

PEQKQIYRFVRTIiFSAAObTAECAIVlXVyi 1 ERI J LTYAErDICP 
ANWKRIVI^AIIiI^KVWnrXlAVWNVDYCX}IUro 
RQFLELl^FNINVPSSVYAKYYFDLRSLARANNLSFPLEPI^RE 
PJmjQ^EAISRr>OTDKYKI)LRRSARKRSASADNLTLPRWSPAIIS 


673B 


148 


653 


CACAEQPARAEVGAATALPVRWASGEa^APSGSLAVPLAVLVLLL 
WGAP WTHGRRSNVRVI TDENWRBIiIiEGDWMI E FYAP WCPACQNIj 
OP E WESFABW3EDLEVN IAKVDVTEQPGLSGRFI ITALPTIYHC 
KDGBFRRYQGPRTKKDFINFIS0KEWKSIBPVSSWF 


6739 


3 


631 


AlffiESSSKSFISRIiAIVADLYEQEQYSDI^IKVGDRHISAHKF 
VLAAR^DSWSIJUnjSSTKELDI^DANPEVIMTMIJlWlYTDELEF 
REDDVFLTEI^KLANRFQI^LLP^CEKGVMSLVNVRNCIRPYQ 
TAEELNASTLMNYCAEI IASHWV9EVEGVKKAL 


6740 


3 


631 


SV*PDMAEEEVAMiEKHIjMi»IjRQEYSnQjQKKIiAETEKRCAL 
ANK^SSSESFlSRiliAIVADLYEQEQYSDLKIKVGDRHISAHKF 
VLAARSDSWSLANLSSTKELDI^DANPEVTMTMIjRWIYTDELEF 
REDDVFLTE LMKIANRFQIjQIJ^HRCEKGVMS LVNVRNC I RF YQ 
XAEEimSTLMNYOVRIIASHWVSEVEGVNKAL 


6741 


141 


960 ■ 


PLTLPFSSRARAGHTMNT6 PGTVGSDPVILATAGYPHTVRFWQA 
HSG I CTRTVQRQDSQVNAIiEVTPDR S MIAAAVQ P VS LG YQH I RM 
YDLNSNNPNP1 ISYIX5VNKNIASVGFilBDGRWMYTGGEDCTARI 
WDLRSRNLQCQRIFQVNAP INCVCLHPNQAEI.IVGDOSGAIHIW 
DLKTDHNTEQLI PEPEVSITSAHIBPDASYMAAVNSTLVPPS CLIi 
PlAIGIliQ^KFESIJ^RGLLFI^CO^CYVWNLTX^IGD 
LIPKTKIP 


6742 


141 


960 


PLTLPFSSRARAGHTMNTSPGTVGSDPVILATAGYDHTVRFWQA 

HSG I CTRTVQHQDSQVNALEVTPDRSMIAAAVQP VSLGYQH I RM 

YDLNSNNPNP I ISYDGVNKNIAS^FHEDGRWWTGGEDCTARI 

WD t^RNLQCQRI FQVNAP INCVCXH PNOAEL I VGDQ^^ 

DLKTDHNEQLIPEPETVSITSAHIDPDASYMAAWSTLVPFSCLL 

PLAIGIIiQKGKFESLAPJ^IiLFIJiCQG 

IiIPKTKIP 


6743 


1 


412 


MHSTQDBCSimEGDPNPSAAPTSTCAPRKMPKRIS ISKQLASVK 

ALRKCSDI£KAIATTALIFRNSSDSIX3KLB^ 

FAEGQET KPKYRE I LSELDEHTEIWIiDFEDFMILIJjS ITVMSDL» 

LQNIR ' 


6744 


95 


1343 


rtparnrcagcevls r fss pnkass falqsaggglp avralrrd 

RQKVSTVG YGMDEVEQDQHEARLKKLPDS FDTTGTGSLGQEELT 
DLCHMI*SLEEVAPVLQQTLI*QI>^^ 

LSNEBHFQEPDCS L KAQPKYVRGGKR YGRRSLPBFQESVEEFPE 
VTVIBPIxDBEARPSHIPAGDCSEHWKTORSEBYEAEGQLRFWNP 
DDIiNASQSGSSPPQDWT EEKLQEVCEDI^lTRDGHLNRKKLVS I 
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ID 
NO: 


Predicted 
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corre spondi ng 
to first 
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amino acid 
sequence 


Predicted end 
nucleotide 

corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C«Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F— Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K«I»ysine, 
L=Leucine, M=*Methionine, N=Asparagine, 
P=Proline, Q=Glut amine , RsArginine, 
S=Serine, T«Threonine, V^Valine, 

W^-HPwrvifr- rvr^H^n V — Twmi tip 5f=rintcnowr\ * — i Q H on 

Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PSASTPYRQIiKRHLSMQSFDESGRRTTTSSAMTSTIGPRVPSCL 
TLALBNELLVTKNSIHQACI 


6745 


1 


5B8 


TFRDQ^WAQRRRWIJ.GCASWESWKAAIAAGPGIiP 
AAGTECFAAVWARGTAMGSVI^TDSGKSAPASATARALERRRDP 

jjlifrV J OF Iff. ft"^-ilJnVf I Kl|linVPL.Kat — miOJUaWWWI J-*— 

PYCMYLPSEGVPATDVAKRMKSEYKHCAEOm,VCLSEWRA^ 
RTCQKYXDIGfGPLQKLSETA 


6746 


110 


492 


GATGAMAKSAPARHRRKKRSTPLTSSTLPSQATBKSSYFOTTEI 
S LWTWAAIQAVBWCMBSQAARWSLEGRTGTAEKKIiADCBKMA 

imivwYT is/vvtdiktrr nwr TrtDVrtT TJ^ODT.CTWITCKrT^T^WKroKr 

VKFG^K}i-»EGKWAVlJGi UJL^tiXKiLiLAJ,titUjtSti VIWLyUxuNKJH 


6747 


247 


484 


KAVTPKDVAWPTBEEI^J^DLAOJ^iYRDVMLKNFRNIjLSVGH 
QPFHRDTFBFLREEKFWMMDIATQREGNSVYAGVC 


6748 


201 


665 


^^^^FKEAVTFKDVAVVFTFJ3BI^ 

LSVGNQP FHQDTFHFLGKE KPWKMK'l'TS QREGNSGGKI Q I EMET 
VPEAGPHEBWSCQQIWEQIASDLTRSQNSIRNSSQFPKEGDVPC 
QIEARLSISXVQQXPYRCNECKQ 


6749 


95 


719 


RREVKGGDGVCPRARG3POSO^FPSC7AGGGEGU3QSGEALIX^ 
SAGGPCPAAAGGGPGGASCSVGAPGGVSM FRWLEVLEKEFDKAF 
VDVDDLLGE I DPDQAD I TYEGRQKMTS LSSCFAQLCRKRQ9VSQ 
INHKLEAQLVDLKSKLTEUXJAEKVVLBKEVHDQIXQLHS IQLQL 
HAKTGQSADS GTIKAKLSGPS VEELE RELKAN 


6750 


3 


428 


SCESRRPGATO^ASGALPRDTTGLGSEQPSGDVAQSNRATMGT 

TAPGPIHIJ^IiCDQKLMEFLCNMDNKDDVWLE 

EPS KEPELM PKT PSQKNRRKKRR I SYVQDBNRDPIRRRLSRRKS 

RSSQLSSRR 


6751 


152 


1417 


PT KATEMAGAS VKVAVRVR P FNS REMSR DS KC 1 1 QMSGSTTTI V 

NPKQP KKTP KS FS FDYS YWSHTS PEDINYASQKQVYRD I GE EML 

QH7VFEGYNVCI FAYGG/I^AGKSYTMMGKQBKDQQGI IPQLCEDL 

FSRINDrTbTONMSYSVEVSYMEIYC^RVRDLLNPKNKGNIiRVRE 

HPLLGPYVEDLSKLAWSYlTOIQDIiMDSGNK^ 

SRSHAVFNI I FTQKRHDAETNITTBKVSKISLVDIiAGSBRADST 

GAKGTRLKEXlANINKSIiTrr/SK^SAIiAEMD 

FI P YRDSVI1TWLI1RENLGGNSRTAMVAAI1S PADI N YDETLSTLU 

YADRAKQIRC^VTNBDPNNKLIRBLKDBVTRIJiDIiLYAQGriGD 

ITDHTNALVGMSPSSSI>SALSSRNV 


6752 


24 


1834 


RNCVPPLGC YRSRVKFHSD I KMQYSHHCF^LIiERXiNKOREAGFli 
CTCTIVTGEFQPKAHRNVIASFSEYFGArYRSTSENW 

KIKMEDFAFITVNPSSTBISSITGNIEIjNG^^ 

SEVSTDIj IQANP KO^ALAKKSSQTKKKKKAFNS PKTGQNKTVQY 

PSDIIiENASVBXFLlWKLPTPVVl^VA^ 

TFPAQDIVHTVTVTOiKRGKSQPNCALKEHSMSNIASVKS PYBAE 
NSGKKLlX^RYSKA^fOiTCGKVFSFASSLRRH^IHKGVKPYV 
CHLOGTCAFTQCNQLKTHVRTO 

HSRMHHGEEKPYKCDVCNIXJFATSSI^KIHARKHSGEKPYVCDR 
CGORFAQASTLTYHVRRHTGE KP YVCDTCGKAFAVS S S L ITHS R 
KHTGEKPFICEIiCGNSYTDI KNliKlulKrKVH^ADKTLDS SARD 
H1X5EQDS IQKSPi^ETMDVKI'SDMTLPI^u^I^TEDHHMLLPV 
TDTQSPTSDTlxLP^TVNGYSEPOLIFI'QQLY 


6753 


2 


1305 


VPSLPYPPQKWAHTEFTTSSDSETANGIAKPDPVMPGGEEKA3 
prciKI^TNYSIxRl^aXKiASQKKKia^ 

SARGEKEMEGVALK1R5PSLPQBRKGAPSTRRBSAEPSSSRSVPV 
AHPGPP PASSQT PAPEHDKAANKMPLAQKPALAP KPTSQTP PAS 



547 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
emu.no acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C** Cysteine, D»Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G-Glycine f 
H^Histidine, I»Ieoleucine, KoLysine, 
X^Xieucixne , t4~Methiomne , NsAspar&gine , 
P«Proline, Q»Glutamine, R^Arginine, 
S=*Serine, T=Threonine, V=»Valine, 
W=Tryptophan, Y-Tyrosine, X=Unknown , *«=6top 
Codon, /— possibXe nucleotide deletion, 
\=possible nucleotide insertion) 








PLSKLSRPYLVELLSRRAGRPDPEPSEPSKBDQESSDRRPPSPP 
6PBERKGQKRDEEEEATBRKPASPPIJPATQQEKPSQTPBAGRKE 
K PMLQSRHS LDGSKLTEKVETAQ PLWITLALQKQKGFREQQATR 
EERKQ ARKAKQAEKLS KENVSVS VQP6SSSVSRAGSI»HKSTAI>P 
EEKRPETAVSRLERREQLKKANTI.PTSVTVEISYSSPAAPLVKB 
VSKRPSSPDBI^VSSEPAWlJ^AKRKAKnWSDCPLIIK 


6754 


2 


413 


FVHRRRRRJL^GPEVNTMSSIJiK5RlADFCH)VLKBPSIAI*EXliRE 
I^PSGlPCEGGLRCbCWKILLNYLPLKRASHTS IliAKQRBLYAQ 
PLREMI IQ PG I AKANMG VSREI5VTFEDHPLNPNPDSRWNTYFKD 
NEVLL 


6*755 


298 


1343 


PGltQI^VALKADWFLDMPGGRRGPSRQQliSRSALPSLQTLVGGG 
C5GNGTGLRNRNGSAIGLPVPPITALITPGPVRHCQIPDLPVDGS 
I^PRPLFFIYIJ.VALFIQYINIYKTVWWYPYNHPASCTSLNFHL 
IDYHLAAPITVMLARRLVHALISEATKAGAASM IHYMVL ISARIi 
VIJ/ri^WVLCWTLVI^FRSHSVI^^ 

DSRAHLIJjTDYNYWQHEAVEESASTVGGIAKSKDFIiSLI>rjESL 
KE0 PNNATP I PTHSCPLS PDDIRNE VECIiKADFNHR IKEVL FNS 
LFSAYYVAFLP LCWKVSGY I.TPTICFIJDLCrVNY BJWVFL.V 


6756 


180 


754 


IBRALGSLPLS I PVSWGSLRTLKYQQQPLRPKVLLCQTRVQCHD 
IJ^LQPQPPGLKQSFCLRVI^LQTGATTPGLRDI.TCKEIil ILTE 
REAQKRKKRKEKESGMAiTQGPLTFRDVAIEFSQEEWKSLDPVQ 
KAr.YWDV^ENYRNLVPliGKI>NFALEVKICPRVFLYKLCCLSWK 
PPHYLTETEALLTHK 


6757 


2 


459 , 


NSRVEAPEAHSRBSQGSDAMRKHI^WWWIA'TVCMLLFSHLSAVQ 
TRGIKHRIKWNRKALPSTAQIYEAQVAE^PGAPXKQGRKLDID 
FGABGNRYYEANY^QFPIXSIHYNGCSEANVTKEAFVTGCINA 
AANQGBFQKPDNKLHQQVLW 


6758 


1 


1008 


ASGPEliPGRRPRDRAPWnPARLLRGVIiAVWVSLSALGPGSFCRR 
RVPSLAQLGHSBAAPSPDDVRWSRVPDRCPEHRDRAWPPPPPPS 
LPPS FRRNMANNSPAIiTGNS QPQHOAAAAAAQQQQQOGGGGATK 
PAVSGRQGNVLPLWGNEKTMNLNPMI^ 

rYHEVVDEIYFKVTHVEP^EKGSRKTAGO/TGMC^^ I 
VSTAPCMjYIOjPTLKLTRRQVMGIiITHTDS P YIRALGFMYIRYT 
QP PTDLWIWFES FLDDEKDLDVKAGGG CVMTIGEMLiRSFLTKIiE 
WPSlXPPRIPVPVQKNriDQQXKTRPRKI 


6759 


1 


513 


RKHNFB^I^XSTOAPHPQTGLPLI^SPVPQRIcroSGCT 
l*iai>K3FSSRSPRPCLNIEDDPDIHEKPFLSSSAPPITSLSLLG 
NFEESVIilfiniFI?PI^I VDGFtCAEVGASGAFCP^ FY 
SVSDDl^SPYl<GVITLESI^KRGYRVPPSGTiaVVCVl» 


6760 




606 


VLS KKKGLSAEEKRTRMME I FSBTKDVFQIiKDLBKI AP KE KG IT 
AMS VKEVI.QS LVDDGMVDCER IGTSNYYWAFPSKALHARKIIKIiE 
VLESQLSEGSQKHASLQKSIEKAKIGRCBTEERT 


6761 


29 


1733 


ERTI^IJlEVAAPSDVADAAVSRRGRCXCa^CTO^VAQDCPS 
SSSSVQRCELSLFQSLHTMTSKKLVNSVAGCADDALAGLVACNP 
NLQIaLQGHRVALRSDLDS LKGRVALLSGGGSGIIEPAHAGFIGKG 
WLTGVIAGAVFTSPAVGSIIAAIRAVAQ^ 
RI*NFGIAREQARAEG I P VBMVVIGDDSAFTVLKKAGRRGLCGTV 
LI HKVAGALAEAGVGLE E IAKQ VNVVTKAMGTLGVSLSS CS VPG 
S K PTFELSADEVELGLG IHGEAG VRRI KMATADE IVKLMLDHMT 
NTTNASHVP VQPG3S VVMMVNWLGGLS FLELGI IADATVRSLEG 
RG VKIARALVGTFMS ALEMPG I SLTLbLVDE PLLKL IDAS TTAA 
AWPNVAAVS ITGRKRSRVAPAHPQEAPDSTAAGGSASKRMALVL 

PP PAS PAQLLS IG^Sin>LLEJa>K^S SGALYGLFLTAAAQPLKAKT 
SLPAWSAAMDAGLEAMQKYGKAAPGDRTMLDSLHAAGQBL 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
Sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=»Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F- Phenyl alanine, G=Glycine r 
H«Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P*=Proline, Q=Glutamine, R^Arginine, 
S^Serine, T=Threonine, V= Valine, 
W=Tryptophan , Y=Tyrosine, X-- Unknown, *=Stpp 
Codon, /-possible nucleotide deletion, 
\-possible nucleotide insertion) 


6762 


3 


613 


ASTI SWRLCVAGABARRP VPVAGKRAGGGAMWFMYLLS WLSLFI 
QVAFITLAVAAGLYYLAKLIEEYTVATSRI I KYM IWFS TAVh IG 
LYVFERFPTSMIGVOLFTNLVYFGLLQTFPFIMLTSPNFILSCG 
IiWVNHYLAFQFFABK Y Y p FS KVLAYFTFCLW 1 1 PFAFFVSLSA 
G SNVL PSTMQ PG DDWS N Y FTKG KRG K 


6763 


2 


760 


SGPDFPGRKFRGCCCWPPAGAGMELGGHWDMNSAPRLV3EXAB 
RKQEQKTGTKAEAADSGAVGARRFLLCLYLGGFLDLFGVSMVVP 
IMLHVKSW^PTVAGIVGSSYGII^bFSSTLVGCWSDVV^ 
S SLLACIIiLSALGYIjLLGAATNVFL fvlarvpagi fkhtls isr 
ALLSDVVPBKBRPLVIGHFNTASGVGFI IjGPWGGYI.TEIjKDGF 
YLTAF I CFIiVFI LNAGLVW FFPRREAKPGSTE 


6764 


80 




T^KKMDThWLS VRNLPBQI>VRRVK I LS BGNEVQF I QIAKD FBDFR 
KKWQRTDHR LG KYKDLLM K AJ^TTERSAIJDVKLKIIARNQVDVB I KR 
P^RABADCEKr^R^IQLIRKMLMCDTS03IQ 


6765 


3 


550 


ARYSRVDHFCRRRORAVARAPRFIjIjQFP^ 
RGSVIiVSEAI^GSAKDGIVTEVAVUVKRGSDELLSGSVI»SSPN5 
NMSSMVVTANGNDSKKFKGEDKMIX^ J 
VIJU^LPFGKVTNILMIJCGKNQAF^ j 
PHLRNQ i 


£7(U 


X 


1287 


EGGSFKASLTH1.WPLGBMKLHCEVEVI SRHIjPALGIjRNRGKGVR 

AVLSIiCQQTSRSQPPVRAFIiLISTLKDKRGTRYELRENIEQFFT 

KP^BaKATVRLXBPPVDlCl^KT^SSSIJCGFIiSAMRIiAHRG^ 

ViyrPVSTLTPVKTSEFEKFKTKWVITSKKDYPLSIQrFPYSLKHL 

QTTSYCX3LVRVDMRMLCI)KSLRKLDLSHNHIKKIiPATIGDLIHLQ 

EIJ^DNHI^FSVALCHSTIOKSLWSU^ 

LQELKNLKLDDNELIQFPCKIGQLINLRFI^^ 

mhS IJKYIiDLFGNTFEQPXVLPVI KLQAPLTLLESS ARTI LHNR 

IPYGSHIIPFBX(^DLDTAKICVa?RFCI^SFI<KJTTTMNLHSV 

AHTWI^VDNLGGrEAPI I SYFCSLGCYVNSSDI 


6767 


336 


919 


APH I CLCSSDIiQFR YKEAFLRDRGLQ I GYCSVDDDPRMKHFLNV 
GRLQSDNEYKKDFAKSRS QFHSSTDQPGIXC^KRSQQriAS DVHY 
RQPLPXJPTCDPEQLGLRHAQKAHQLQSIJVKYKSDLNLTRGVGWT 
PPGSYKVEMARRAAEXANARGLGLO<^YRGAEAVEAGI^IQSGEV 
NPDATK r LHVKKKKALLX. 


*76*r: 


2 


363 


PGSTISCxI*I>SEG8I»PU^VA£X^BKHRAP^ 

LRLSPTDLGSCPPCGPCPIPKPAARGRRO^O^WGKSDERLLOAV 

F^rmPRVAAI>TARKCLVPTKIJ)PEGKSAFHL 


6769 


284 


396 


MSTPDFSTAENWQELANBVSCIJ(AMIiTLMI«OAMGQA^ 


6770 


1 


397 


QRNYQVIWSS-mAXIiHDYYKDEWKKJ^RFWaiSVMQVPRVEK 
ITLHMGVGEAIADKKLLDNAA7y3LAAI SGQKPLITKARKSVAGF 
KIRQGYP IGCKVTLROFJmWEFFHRLlTIAVPRIRDFRGLSAKS 


6771 


3 


376 


APAGTIiAMTCKSVKDVDRYQAVIiANLLLE^ 

WASWNIGVFICIRGtfJIHRNl/TVHlS^^ 

QKl^NGKANRLYEAYLPETFRRPQIDPYLF?7SNLEG 


6772 


1 


1400 


AAAFLQGMTVNGFINXVITSl>\ERRYDLHSYQSGIiXASSYDIAA 

CLCLTFVSYFGGSG\HKPRWIX5WGR\VLMGTGSLVPALPH 

P* *gwkldagvrtcpanpr\ PVCAG\HTSGLSRYQLVFMLGQFL 

HGVGAT PLYTLGVTYIiDENVKSSCSP I YIAI FYTAAI LGPAAGY 

LIGGA1LNI YTKMGRJ^TELTTESPLWVGAWWVGFIxGSGAAAFFT 

AVPILGYPROJjPGSQRYAVMRAAEMHQLKDSSRGEASNPDFGKT 

IRDLPLSIWIAI^PTFIIXCIAGATFATLITGMSTFSPKFLES 

QFS^ASEAATLF<JYXWPAGGGGTFIX3GFFVNKLRLRGSAVI K 

FO^CTVVSIMILWSIiHCPSVPMAGVTASYG^ 

TAPCNAACS C^PBHYSPVCGSIXSLMYFSI^IAGCPAATEITWDG 

QKVYRDCS CI PQNLSSGFGHATAGKCTST 
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SEQ 
ID 


Predicted 
beginning 

nUCJlcOLXQc 

location 

J- C O JJUJ_H_LLlJy 

to first 
amino acid 
residue of 
amino acid, 
sequence 


Predicted end 
nucleotide 

XOCa W JLVU1 

corresponding 
to first 
amino acid 
residue of 
amino acid 
Sequence 


Amino acid segment containing signal peptide 
(A^ Alanine, (^Cysteine, D=Aspartic Acid, R= 
uj.ucaTuxc flcia, r — irntuiyxa_Lanxne F b-oiycine f 
H=Ristidine, I=Isoleucine, K^Lysine , 

T.^T.^iif *i no !VI— -M<>h Vi"i nni no K^&cnav^irri no 

P=Proline, Q=Glutamine, R-Jlrginine, 
S^Serine, T=Threonine, VWValine, 
W«Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion f 
\=possible nucleotide insertion) 


6773 


1 


630 


PWRAPKEEKYKAEEHTVVLTVTGBPCHFPFQYHRQI>YHKCTHKG 
RPGPQPWCATTPNFDQDQRWGYCDEPKKVKDHCSKHSPCQKGGT 
CVNMPSGPHCLC PQHI»TGNHCQ KBKCFEPQLLRFPHKNE I WYRT 
E(^VARCQCH(GPDAHCC^iASQAC^TNPCIflGGRCLEVEGHRIi 
CHCPVG YTGPFCD VGE * GSGASRRPAPRWDGLAR 


6774 


146 


389 


LTBI^DQQY?LFPII^S/WVPTFI*SMDVDGRVIKADSFSKIISS 
GLRJGFLTCPKPLIERVTLK1QVSTLHPSTPNQLMISQ 


6775 


104 


614 


TCPSQLHVLTARGGRRAPSPQLWTIjVIJa^ISSKV^SHRII^RMNS 
GRPE1WENLPALYTIFQGEVAMVTDYGAFI KI PGCRKQGLVHRT 
HMSSCRVDKPSEIVI>VGDKVWVXLIGREMKNDRIKVSZ>SMKVVN 
0^rreKDLDPNNV\SLSKKRGGGDPSRlTLGRRSPLRLS 


6776 


3 


1108 


HERHRRHEGAJjSQDALLRIS 1PLDSNMRPEKCRRFVHPQWQLLU 
I^TFPKTSDAO^BPC^GKVYDRISFSSTIVTEWDLVCDSQSI. 
TSVAKFVFttAGMMVGGII^HLSIIRFGRRPVIJWCYLQVAIVGT 
0\ALAPa , FLIYCSLRFT J SG:U'AMSI^ 

GITIX>!CP£K;IAF^mJ^I>AFAIRDWiIILQLVVSVPYFVIFLTS 
S W LLESARWL I INNKPEEGLKELRKAAHRSGMKNARDTLTDBIIb 
KSTMKKELEAAQKKKPFLGBRLHMPNICKRISIJjPFI'KFANFKA 
YFGItfLHG/LKHLGNNVFIJ^ 
RVSSRGRVNCLGLFVLQVW 


6777 


779 


63 

- 


cffhgpawrdcbvratfakkqgqsgiisciafspaqplyaogsy 

GRSLGLYAVTODGSPIiAI.IjGGHOjGGITHLCFHPDGNRFFSG 
AEiLCWDLRQSGYPLWSLGREVTTNQRiyFDIiDPTGQFLVSGST 
SGAVSVWim^PGlMKPEPVJ^FLPQKDCTNGVSIJlPSLPLrX? 
HCI»PV5VCPLS PTESGGRRRGAGPSLGS PRRHVHLBCRLQLWWC 
GGGARLQHP* * SPRARKGR 


6778 


311 


805 


IQSITDBSRGSIRRKl^AlSrrRI»RLNVP\BBTAGDSE/ERSPBEB 
VQADPRIRS AS PKCFTS S PFPKGRS PEG BGET \ D PE KVHFHPGP 
KDKSVAEKN\KGP\SPVSSBGI KDFFSMKPEWENLNQSNVRRMH 
T\AVRLNEVIVKKSRIlAKLVIiLNMPGPPRKRNGDENY 


6779 


2 


535 


RAIJiRQPRLIiAANGIBPBSMAISEPIKr,SRKPCVNKEBLALK^ 

MAKCAWKGPREPPQDARAEAESPGGASE3t)QIX5GHESPPKKKAV 

AKVSAKNPAPfORKKKKVSLGPVSYVLTOSB^ 

SRREASDQKAPRGQQPAEATASTSRGPKAKPKGSPRRATNESRK 

V 


6780 


3 


403 


HE VNDNKPE ININLMS PGKEEIS YI ?EGDP IDTFVALVRVQDKO 
SG LNG E I VCKIiHGHGHFKIjQKTYENNYL iltnatudke krs b ys 
LTVlAEDRGTPSI^TVKHFTVQINDlNnNPPHFQRSRYEPVISB 
K 


6781 


1 


1269 


APTRPVFPTLQPI^SSKEPSNSLNIiPHSNELCSSLVHPELSEVS 

SNVAPSIPPA^RPVSSSSISTPLPPNQITVFVTSNPITTSANT 

SAALPTHLQSALMSTVVTMPl^SK^^ 

TPVFINSSSIIQVMKpSQPSTIPAAPLTTNSGLMPPSVAVVGPL 

HIPQNIKPSSAPVPPWALSSSPAPNIQTGRPLVLSSRATPVQLP 

SPPCTSSPWPSHPPV(X3Vi<El^DE^PQVNTSAIX}NTI.PSSQ 

STTMVSPIjLTNS PGSSGNRRSPVSS S KGKGKVDKIGQI IiLTKAC 

KKVTGSbEKGEEQYGtfUX3ErrEGOGIJ7^ 

KTPTPPAPTLLKMTSSPVGPGTASAGPSLPGGAI^PTSVRS IVTT 
LVPSELISAVPTTKSMHGGIASBSLAG 


6782 


3 


1327 


RKPWIRIPAKPGKCLHEDPQSPPPLPAEKPIGNTFSTVSGKLS 
NVBRTRNLESNHPGQTGGFVXVPPRLPPRPVNGKTI PTQQP PTK 
VPPERPPPPKL3ATRRSNKKLPFNRSSSDMDLQKKQSNIATGLS 
KAK^QVPKNQDPVLPPRPKPGHPLYSKYMLSVPHGIANEDIVSQ 
NPGEI£CfttGDVLVMIiKO/nniOT 

PTiDKHIiRSRPN PFS PPKAPS HAQKP VDSGAPHAWLHDFP AEQV 
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SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
( A-Alanine , (^Cysteine, t)=Aspartic Acid, E= 
Glutamic Acid, F»Phenyl alanine, G -Glycine, 
H=Histidine, I-Isoleucine, K=I*ysine, 
L=Leueine, M Methionine, N-Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V^Valine, 
W-Tryptophan, Y^Tyrosine, X -Unknown, *^Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DDLNLTSGEIVYL^EKIDTDWYRGNCRNQIGIFPAKYVKVI I DI 
PEGGNGKRECVSSHCVKGSRCVARFEY IGEQKDELSFSEGEI 1 1 
LKEYVNEEWARGEVRGRTGI PPLNPVE PVEDYPTSGANVLSTKV 
PLKTKKEDSGSNSQVNSLPAEWCEALHSFTAETSDDLSFKRGDR 
I 


6783 


"~ 3 


1750 


S YHHHHAQQSAAAS PNLTASQKTVTTTSMITTKTIiPLVIiKAATA 

TMOA CW W^WDDTT KMUTB TWCfMtfTLtlT C'l'l>UI >MTPirHT YYT*C C tfl/M» 
Xvlz'Ao V VbUKlr X X nPiv I A JLMoy ft/* V Jjo LUVlJFt X Jtr V WLA/A o i>xL V 1 

GPGAEAVQIVAKOTVTLQVC^PPQPIK^PQFIPPPRI.TPRPNP 
LPOArePKFVAQNBttPIAPAPPPmAAP^ 
PTSONS IHPVRVVNGQ^TIAKTFPMAQLTSIVIATPGTRLAGP 
QTVQLS KPSLBKQTVKSHTETDEKQTESRTITPPAAPKPKREEN 
PGjKLAFMVSLGLVTHDHLEE I QSKRQERKRRTTANPVYSGAVPE 
PERK3CSAVTYLNSTMHPGTRKRGRPPICYl^VIiGFGALTPTS PQS 
SHPDS PENEKTETTFTFPAPVQP VS IiPSPTSTDGDI HEDFCS VC 
RKSGQI^Cim3RVYHI,DCLDPPlVKTI 

KEEAIPWPGTLAJ1VHSYIAYKAAKEBEKQKIXKWSSDLKQEREQ 

LEQKV KjQLSNSIS kcmemkntilarqkemhsslekvkqlirlih 
GIDLSKPVDSEATVGAlSNGPIXn*PPANAATSTPAPSPSSQSCT 
ANOJQGEETK 


6784 


3 


1750 


S YHHHHAQQSAAAS PNLTASQKT\rrTT3MITTimiPLVLKAATA 
TMPAS WGQR PT IAMVTAINSQKAVI^TDVC^TrP VNIjQTSSKVT 
GPGAEAVOIVAKin^mjQVOATPPQPlKWQFIPPPRLTPRPNF 
LPQVRPKPVAQNNX PIAPAPPPMLAAPQLIQRPVMLTKFTPTTL. 
PTSQNS IHPVRWNGQTATIAKTFPMAQLTS IVTATPGTRLAGP 

PQKIJVPMVSI/5LVTHDHLBEIOSKRQBRKRRTTANPVY^ 
PKRKKSAVTYLNSTMHPGTRKRGRPPKYNAVIX3FGALTPTSP 
SHPDSPBNEKTBTTPTFPAPVQPVSIiPS PTSTBGD IHEDFCSVC 
RKSGQIJJMCmCSRVYHI^CIiDPPk^ 

KEBAI PWPGTIAIVHS YIAYKAAKEEEKQKLLKWSSDIjKQERBQ 
LKQKVKQliSNSISKCMEMKNTIIARQKBMHSSLEKVICQLIRljIH 
GIDLSKPVDSBATVGAISNGPDCTPPANAATSTPAPSPSSO^CT 
ANCNQGEBTK 


6785 


1 


52 B 


LGimrUHYCSMYSKPEaLKI»lJ»R8KPT^ 

I»KATOX!EDLLSOAKSGKFNPHVHVBYEWLRQEE IDRSDDDIiDD 
KPSPVKKERSPRPQSFCHSSS IS PQDKLAItPGFSTPRDKQRIiSY 
GAFTNQ I FVSTSTDS PTS PTTEAPPLP PRNAGKGPTGPPITPHR 


6786 


1820 


1397 


RSPKVLVUVPTRBLAMHVSRDFKDl \TRKLTVARFYGGTSYQSQ 

INHIRNGIDILVGTP^IKDHI^SGRIJ^LSKIJIHVVLDEVDQML 

DLGFAEQVEDIIHBSYKTDSEDNPQ^ 

YMKSRYEQVDLDGK>TOKAATTVBH1AIQCHWSQRPA 

VYSGS RGRAI IFCE TKKNVTEMAMNPH IKQNAQCLHGD IAQSQR 

EITI*KGPllEGSFKyLVATNVAARGl J DIPBVDLVTQSSPPQDVES 

YIHRSGRTGRAGRTG ICICFYQPRERGQIiRYVEQKAGITFKRVG 

VPSTMDLVKSKSMDAIRSIASVSYAAVDFKRPSAQRLIEBKGAV 

DAIAAALAHISGASSPKPRSLITSDIDGFVlt4TLESLEElQDVSC 

AWKElJffRKLSSNAVSQITRMCLIJCGMMGVCFDVPT^ 

WHDSDW I LSVPAKXPEI EEYYDGNTSSNSRQRSGWSSGRSGRSG 

RSGGRSGGRSGRQSRQGSRSGSRQDGRRRSGNRNRSRSGGHKRS 

FD*VFYHI*VDFLSDFLVDSVYI J TGRQIDlll>l-GLTGLIDHLTSHS 

SVWN 


6787 


2646 


2270 


PSSFPKNVPLEEIiEEPPK*KRSGLGSLTPKSQIO*fGP* PQTFFF 
FEI^SPSGVISAHCmRI^GSSDSPAPASRVAGIIGTCHHAWLI 
LVFL VBMGFlfflVGOAGlJCLLTLX VI HP P WP PKVLGLQT 


6788 


16 


936 


GGTV DLR \DMLAVS VLAAVRGGR /ATVRRVRESNVIJIEK3KGKT 
REGABCKMTSGDVLSNRKMFYLLKTAFPSVQI*rra 
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SEQ 
ID 
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beginning 
nucleotide 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C^Cygteine, D=Aspartic Acid, E= 
Glutamic Acid, F»Phenylalanine , G-Glycine, 
H~Hi s t idine , I -Isol eucine , K=I»ys ine , 
L=Leucine, M=Methionine, N^Asparagine, 
P=PXOline, Q=Glutamine, R=JVrginine, 
S^Serine, T^Threonine, V=Valine, 
W-Tryptophan, Y=»Tyro8ine, X=Unknown, *»Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








EVIIiWGS*DS*GyPKGK*I»LPKEVPSR/RVLLSGIiTPI,DATQB\ 
FTEDLSK\YVTTMVa/AVNGKPMW^ 

WKARSSYNBKTPRIWSRSHSGMVKQVALQTPGNQTTIIPAGG 
AGYKVLALLDVPDKSQBKADLYra 

GIMTTLSGEB I S YTGSDG I EGGLIAS I RMNHQALVRKLPDLE KT 
GHK 


6789 


2 


678 


GKGINVLKIAPESJlIKFMAYEQI *PGDS *GF/YBRI»VA 
GSXiAGAI AQSS I YPMEVIiKTRMAIiRKTGOYSGMLDCARRIIiARK 
GVAAFYKGYVPHMLGIIPYAGIDLAVYKTLKNAWLQHYAVNSAD 
PGVFVLLAOGTMS S TCGQLAS YPLALVRTRMQAQAS IEGAPEVT 
MSSLPKHILRTEGAFGLYI^IJ^NFMKVIPAVSISYVVYSNLKI 
TLGVQSR 


6790 


2 


4066 


APPAGRRRMQAAP RAGOoAALIiLW IVSSCLCRAWTAP STSQKCD 
EPLVSGIiPHVAPSSSSS ISGS YS PGYAKTMKRGGAGGWS PSDSD 
HYOWliQVDFGNRKQISAIATQGR YSS SDWVTQYRMliYSDTGRNW 
KPYHQDGNIWAFPGNINSnG WRHELQHPI IAR YVR I VPLDWNG 
EGRIGLRI BVYGCS YWADVlNFIXWVLPyRFRNKKMKTLKDVI 
AI2JPXTSESEGVIl^GEGQQGI>YITIiBLKKAKLVLSUaiiGSNQL 
GPIYGHTSVMTGSLLDDHHWHSWIERQGRS INLTLDRSMQHPR 
TNGEPD YLDLDYE I TFGG I P FSGKPSS S S RKNFKGCMES HMYNG 
VNITDLARRKKLEPSNVGNT^FSCVEPYTV1>VFFNATSYLEVPG 
RI^QDLFSVSFQFRTWNPJNGLLVFSMFADMLGNVEIDLTESKVG 
VHINITQTKP1SQIDISSGSGIJIDGQWHEVRFIAKKN 
DEASAVRTNS PLQVK*1"GKKYFFGGFLKQMNNSSHSVLQPS FQGC 
MQIilQVDDQLVNLYEVAQRKPGSFANVSIDMCAI IDRCVPNHCE 
HGGKCSQTWDSPKCTCDETGYSGATCHNS IYEPSCEAYKHLGQT 
SNYYWIDPPGSGPl^PIJCVYCNMTEDKVW'riVSHDLQMQTPVVG 
YNPEKYS VTQLVYSASMDQI SAI TDSAEYCEQ YVS YFCKMS RLIi 
NTPDGSPYTWWVGKANBKHYYWGGSGPGIQKCACGIERNCTDPK 
YYCNCDADYKQWRKI^FI*5YKDHLPVSQVVVGm'DRCX5SKAKI J 
SVGPLRCQGDRNYWNAASFPHPSSYliHFSTFQGETSADI SFYFK 
TLT PWGWIiENMGKEDFIKLBI>KSATEVSFS FDVGNGPVEIWU 
S PTPLNDDQWHRVTAERNVKQAS LQVDRLPQQ I RKAPTEGHTRL. 
EhYSQLFVCKiAGGQQGFLGCIRSI^WCn/TLDI^KRAKVTSGFI 
SGCSGHCTSYGTNCENGGKCIJ3RYHGYSCIX!SNTAYDGTFCNK^ 
VG A PFKEG M WLRYNFQAPATN7VRDSS S R VDNAPDQQNS H PDIiAQ 
EEIRFSFSTTKAPCILIjYISSFTTDFIiAVLVKPTGSLQIRYNIjG 
GTREPYtflDVDHRNMANGQPHSVWXTRHEKTrFTxmJHYPS VS Y 
HLPSSSDTLFNSPKSIiFiyGKVTETGKXIXiEIHKYNTPXSFTGCLiS 
RVQFNQIAPLKAAURQTNASAHVHIQGELVESNCGASPLTLSPM 
SSATDPWHIJ>HLDSASADFPYNPGQGQAIRNGVNRNSAI IGGVI 
A\WI FTPSLCTP\VI»P* SR *HVB PHKGTXP I PNBAKGAGS RQK 
KPGRRPSMJINDpPTS<^PlDBSKKEWPIirj?GGYLAMG 


6791 


1801 


1193 


TGH^GAKGBKGDKGDLGPRGERCK2HGPKGBKGYPGIPPKIi/PGW 

C aW* QMT.THaQTirVOftTT.T.DrtOT.PfcmT/^Tli BM&QMlTIIIMlin 
ixtxV v ^ o rt Jj 1 /\rXty X |\ W \J*+ 1 1 »* 'r\Jr JJH IjOI-^ JU/vc r'li\2j I tf\ X fl C oil \J 

NSGII FSSVETNIGNFFPVMTGRFGAP VSGVY FKTFSMMKHEDV 

EEVY^/YLMHJNGNTVFSMYSYEMKGKSOT 

LRMGWGALHGDHQRFSTFAGFLLFRTK 


6792 


33 


1073 


VRHTNWGVDM YLFS LGSBSPKGAIGH I VSTKKT I LAVERNKVLL 
P PLWNRTFS WG FDDFS CC1GS YGSDKVliMTFENliAAWGR CLCAV 
CPS PTTIVTSGTSTWCYWEl^MTKGRPRGl^RQALYGHTQAV 
TCIAASVTFSIiuVSGSQIxnTJILHDLDHLTHVTRi^ ISA 
3TI SDVSGTIVSCAGAHLSLWNVHGQPIiAS ITTAWGPBGAXTCC 
CIjMEGPAWDTSQI I ITGSQDGMVR WKT/VGCEDVCSWTASRRG 
APGSASKPFOIPQVGEEPGLESRAGR* HCFDRBAQQNQP \ PVTAL 
A VS RNHTKLL VGDERGR I FCWSADG * EERGSRG SGTTVPG 
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amino acid 
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amino acid 
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Predicted end 
nucleotide 
location 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid/ B= 
Glutamic Acid, P=> Phenyl alanine, G=Glycine, 
H^Histidine, I=Isoleucine, K-Lysine, 
L~ Leucine, M^Methionine, N=Asparagine , 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, v^ Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /--^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6793 


2340 


805 


GRKEANY\YGSLlXlAGTVSlX3LnAEGQE^WPFSAVLPMVAPNP 

LVFDGWDISSLNLAEAHRI^KVLDWGLQEXJLffPHMEAIjRPRPSV 

YI PEFIAANQSARADNI»IPG5RAQQLEQIRRDIRDFRSSAGLDK 

VI VLWTANTERFCBVT PGI^DTAKNLLRTIBLGLEVS PSTLFAV 

AS ILEGCAFLNGSPQNTLVPGALEJUAWQHRVFVGGDDFKSGQTK 

VKS VLVDFLlGSGIiKTMS IVSYNHLGNNDGBNLSAPLQFRS KEV 

S KSNWDDM VOSNPVLYTPGBEPDHCWI KYVP YVGDS KRALDE 

YTSBLMW^TinijVIiHNTCBMLIAAPIM^ 

en?^PEPQTFHpVLSIiI»SFIiFKAPLVPPGSPVVNAJUFR 

EN I LRACVGLPPQNHMUjEHKMERP^ PMI jNK 

KGPVPAATNGCrrGDANGHLQEF^PMPTT*GPGHTVSRLFLPAAP 

HDPTIiKAPTNKGRCHFSPPSTWGSWGL 


6794 


169 


134$ 


DDVXRKPEASAH*EKPGPPSRPGVRGGRERAGGRGSHGAR5CR\ 
EPAPPAPAPPEDHPDEEMGFTIDI KS FLKPGEKTYTQRCRIiFVG 
NLPTD ITBBD FKRIiFER YGEPSE VF I NRDRGFGF IRLES RTLAB 
IAKAELDGTI LKSRPLR 1 RFATHGAALTVKNLS FWSNELLEQA 
FSQFGPVEKAVVVVDDRGRATGKGFVEFAAKPPARKALERCX3DG 
AFLLTTTPRPVIVBPMBQFDDEDGLPEKL^^^ 
RFAQPGTFEFBYASRWKMtDHMRKQQRBQVDRHIREAKEKLBAE 

HEKKHRRREEEMIRHRBQEKLRRQQEGFKl^NYMElfYVCHFLR 


679S 


1740 


1010 


GPRRQTQ VRD1 IELPS F* PWAAQBTDCAQHSGERL * KGV/LKNPS 

TMSKSAVKISI^IjLSNPLCEQDQDLIiNM^ 

EKWQIQKTVIE PliKKFXSSVFPSLN^VKRREO^UJDYRRLQ^ 

VEKYEEiraKTGPVIAKLHQAREKLRP^ 

FYGSRLDYFQPS FESLIRAQWYYSEMHKIFGDLS HQLDQPGHS 
DEQRERENEAKLSELRALS IVADD 


6796 


48 


683 


GKEIQlPTIKLAWIiFGIJE* PVGALGKGVVS P+ * SHVALGGLGW 
LTRAVRSSWRWELCVSAQEWSQRSA*3SP3PVGACPSLNPPET 
3VQEGRDCWQR * h PRLFS ALVGQPGCWPQGAPPERCV* PGRCXW 
HIjQSQVLR* ERRRCCRCtiPRFA* GWRRRHQRLGLG IHPAPLGST 
SPPHP»^GQCRR*GWAAELRLPSSVVL*GJKLGC* 


6797 


1620 


211 


TERMTPSQPTRGSSCTRPSSMLWTSTVmCLrCHWAGMRMSVVGV 
TLGPMAQGLLS ASGTTTBATWTRPT^ RWWLLTASR VDPP 
SRPPPPPSDDLTU>ESSSSYKNIi/DAQIPQ/DWSM3PSTSG*RP 
I/TSRASS IMRSRTAXPSAS *SRLTTKHTVGGSPSAWRPRPTSRS 
VSTPVSSSXETTASGSCIiTWWSSSPAPCPSSSAPAHSFEASCCK 
TSLWGSC^GSGtK3SSAOGSGHNI£MAGTSCSSPAMCSPSRAPS* 
RSASRPRTWRATTSAASSWAPRRCWCGWA*SAT*PSSTTTISSS 
PHOGWPCPASCASAAAWLSSTWATASVAGSCWGPIM*-SSAHSPW 
a^CSRSSMGTTCL*RSPP\SGASRAAAAWCGSSPSSTPTPSS 
ASSSTWCSASSSRSSPAPTTPSSIPAAQAQRRASCRPTSHSART 
AP P PAS S AAGAAR PAAFSAAAEGTPRRS I RCW 


6798 


3894 


1696 


STlSWESLESWLNKATNPSMRQEDWBYlIGPCDQINKEI i EG*VS 
ALWGQLRGSGLGRGTTMAKEGQPGS PRLSALECVIlLVPQ\ PQ I A 
VRLLAHKI QSPQEWBALQALTYLGDRVS EKVKTKV I ELLYSWTM 
ALPEEAKIKDAYHMIiKRQGlVQSDPPIPVPRTLIPSPPPRPKNP 
VFDDBEKS RI»IJiKLI»KSKNPDDLQEANKLIKSMVREDEARIQKV 
TKRLHTLEEVNNNVK1*I£BML^ 

ENKRRTL FKLAS ETEDNDHS LGD ILQASDNLSRVI NS Y KT I IEG 

PAPTPPSSGIPILPPPPQASGPPRSRSSSQAEATLGPSSTSNAL 
SNLDEKI^l^LADPAPWPPKESAGNSC^iil^REQSDUJFFS 
PRPGTAACGASDAPI>LQPSAPSSSSSOAPIiPPPFPAPWPASVP 
APSAGSSLFSTGVAPALAPKVEPAVPGHHGI>ALGNSALHHIJ^ 
DQIJl^EEAKVTSGLVKPTTS PL I PTTTPAR PLLP FSTGPGS PL FQ 
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corresponding 
to first 
amino acid 
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amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
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residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, O Cysteine, D^Aspartic Acid, E=* 
i>xutanu.c nciu, r— phenylalanine , b=Giycine , 
H^Histidine, I-Isoleucine, K~Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S -Serine, T=Threonine, V=Valine, 
W«Tryptophan, Y-Tyrosine, X»Unknown, +-Stop 
Codon, /^possible nucleotide deletion, 
\opossible nucleotide insertion) 








PLSFQSQGSPPKGPKLSIiASIHVPLKSIKPSSALFVTAYDKNGK 
RILFHFAKECPPGRPDVLWWSMLNTAPLPVKSlVLQAAVPKS 
MKVHCI^PPSGT^I^PFSPIQPPAAITQVMLIJ^LKKiamLHYK 
LTFAI^EQI»STRVGEV1X3FPPVEQVK5NL 


6799 


3894 


1696 


STISWESLESWLNKATNPSNRQEDWEYI IGFCDQINKELEG* VS 

ALWQQIJlG3GI^RGTTMAKEGQPGSPRLSALKCViajVPQ\PQIA 

VKWjAHKIQSPQEWKALOALTYIiGlDRVSEKVKr^ 

ALPBKAKIKDAYHMIiKRQGrVQSDPPIPVDRTLlPSPPPRPKNP 

WDDBEKSKLLAKLLKSKNPDDI^EANKLIKSMVRKDKARIQKV 

TKRIiHTI*EBVOT«NVRI^BMIiI»HYSQKDSSDGDRJ^ 

KNKimTLFKIASETEDND^ I EG 

OVXlTOEVATI>TLPDSEG^S(^SNQGTLIDLABIiDTTN 

PAPTPPSSQIPILPPPPOASQPPRSRSSSQAEATIiGPSSTSMAL 

SWLDEBLLCLGI^PAPNVPPKBSAGNSQWHiaiQREQSDLDFPS 

PRPGTAACX5ASDAPLLQPSAPSSSSSQAPLPPPFPAPWPASVP 

APSAGSSLFSTGVAPAJuAPKVHPAVPGHHGLALGNSAbHHI.DAL 

IXJI*LEPAKVTSGLVKPTTSPLIPTTTPARPLI,PFSTGPGSPLFQ 

PLSFQSQGSPPKGPELSLASIHVPLESIKPSSALPVTAYDKNGF 

RILFHFAKECPPGRPDVLVVWSMIiNTAPLPVKSIVLQAAVPKS 

MKVTO^PPSGTELSPFSPIQPPAAITQVMIJiAtT^ 

LTFALGEQLSTEVGEVDQFPPVEQKGNL 


6800 


404 


1646 


RRSPSTGLSPVP0PSSPSLSDYSIPWSLLLSGT1AHATPGK*AG 
*PC^*I^IAPA1AFI /GLTRGRKQNKBKMAJEGGSGDVDI3AGDC 
SGARYNDWSDDDDDSNESKS IVWYPPWARIGTEAGTRARARARA 
PJlTRARPJ^VQIOZASPNSDDWLSPQEIj<JKVLCLVEMSEKPYIIiE 
AALIALGNNAAYAFNRDI IRDLGGLP I VAKI LNTRDP I VKEKAL 
IVIJ^nn^VNAENQRRLKVYMNQVCKDTITS 
TNMTVTNEYQHMIiANS I SDFFRLFSAGNEET KLQVLKLIJjNIiAB 
I^A>nrPJELLRAQVPSSIiG\SLFNKKBNKBVX IjKLLVI FENINDN 
FKHEENBPTO^OFGBGSLFFFXKJBFQVCADKVIiGIESHHDFljVK 
VKVGKPMAKtAEHMFPKSQB 


6801 


2 


1755 


SAEKFHSGX^VTMHDVnAESFBVLVDYCYTGRVSLSEANVERL 
YAASD^I^YVREACASFLARRLDLTNCTAII^ 
RSQAQSY lAQNF^LSHMGSIREETliADLTLAQI^VLRLDS^ 
VBSBG^CUl^VOWLEAAPKERGPSAAKVFKCVRWra 
YLEGLLTKP I VK3CyCLDVIEGAIjQMRYGDLl» YK3LVPVPNSSSS 
/R * QQQLS C I CSRKSTPBTGY VCQGDGDIiLVn7PQRSI»S \rydpy 
SGD I Y*IT4PS PLTSFAHTKTVTS SAVCVS PDHD IYLAAQPRKDLW 
VY1G?AG^S WGX}LADRlJUCREGMDVAYIiNGYI ytlggrdpitgvk 
LKE VECYS VQRNQWAL VAP VPHS FYS FBL I WQKYLYAVNS KRM 
LCYDPSHNMWLNCMLKRSDFQBACVFNDBIYCICDIPVMK^ 
PARGBWPJllSNIPl^SETHNYQIVNHI^KI^^ 
VTVYKflyrRKI^WINIGTMLGLIiQFT^ 

VAPQRNAQDQQGSL 


6B02 


157 


1341 


ETFPLFFFLLSKTPGKTASMAHFVQGTSRMIAAESSTRHKECAE 

PSTRKNr^SJ^KIRCI^KQPJOSXJiEVNQQWDQQFRSMKKLYE 

RKVAELKTKLDAAERFLSTREKDPHQRQRKDDRQREDDRQROLT 

RDRIjQREBKEKBRLNBBtHELKEENKliLI^ 

EI KRLNKALQDALNI KCS FSEDCLRKSRVEFCHEEMRTEMEVLK 

QQVQIYEEDFKKERSDRERXiNQEKEELOXJINETSQSQLNRLNSQ 

I KACQMEKEKLEKQLKQMYCPPCNCGLVFHLQDPWVPTGPGAVQ 

KQREHPPDYQW YALDQLPPDVQHKAN/DWCXAPPPVCCQAG/ PR 

TPGLK+SSCLWLPKC+NFRPILSKBSPSVBVHTNRERQQATRER 

G 


6803 


1 


2203 


Ki>SGRPYRHJ«VU3TSKLYDIRKTIFTFTPQFIDQC^FYLAi™ 
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to first 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=*Alanine, C**Cysteine, D=Aspartic Acid, E=» 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, X=Lysine, 
L^Leucine, MnMethionine, N=Asparagine , 
PaProline, Q=Glutamine, R=Arginine, 
S =Ser ine , T^Threonine , V=»Val ine , 
W=Tryptophan, Y^Tyrosine, X=TJnknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








KM I VEMLRTDIjS YIiCSRWRMTGQPTITF P I SHSMLDEDGTSLNS 
S I LAAJjRKMQDGYFGGARVQTGKIjSBFLTTSCCTHLS FMDPGPE 
GKLYSKDYDDNYDYI^SCaWMNDYDSTSHARCGDEVARYLDHLL 
AHTAPHPKLAPTSQKGGI^RFQAAVQTrCD 
JNVHMiI»FUKbrvJAo Jurors 

QSGKVDFKALVLQLKKTSSI^KQADILY^YTr^GPDWNTKI»YN 
KRSATTOELLTEIjYGKVGEIRHWGLIRYISGIIiRKKVK 
TDLI>SHQKHIiTVGLPPEPRBKTI SAPLP YEALTQLIDBASSGDM 
S I SIIjTQK I MVYLAMYMRTQPGLFAEMPRIiRIGLIIQVMATELA 
HSIJ?C^AEEATEGrJ4NI>SPSAMlCMLIiHHII^GKEFW 
PTDSNVSPAISIHEIGAVGATKTERrGIMQLKSEIKQVEPRRLS 
I SAE SQS PGTSMTPSSGSFPSAYDOOS S KDS RQGOWQRRRRI»DG 
AI»NRVPVGFYQKVWKVLQKCHQ1>SVBGFVLPSSTTREMTPGEIK 
F3 VHVES \VLNVLI»RPEYRQI»I»VEAI LVLTMLADI B IBS IGS 1 1 
AVEKIVHI ANDIiFLQBQKTLGP \DDTMIAKDPASG\ I CTLR\ YD 
SAPSGRFGTMTYLS\RAA\ATYVQBPLP\HS ICAMQ 


6804 


1 


951 


GSPGKKEEKAKNKESLCME^SSNSSSDEDEEETKAK^PTKKYN 

laJVWSSIQGQWPKFJTLKELFSDSDTBAAASPPHPAPEEGVAEES 
LQTVAEBBSCSPSVELEKPPPVNVDSKPIF^KTVEVNDRKAEFP 
SSGSNFSA* IPLFYLHIiNRIiHQSL*QKGSROX>SSVTVSBPIAPN 
QBEVRSIKSETDSTIEVDSVAGBIjQDLQSERE*IjASRF*CQCKIj 
KQ* *SAR r TRTS*KSLYRSEKSBRCSGRRKFI KKAEKKP * SNSGK 
QQKEGKRHK 


6805 


1539 


206 


RQPDLKYFGKSFDVSVSESSSLLSNDLPKFADGIKARNRNQNYL 

VPSPVLRILDHTAFSTBKSADIVICDEECDSPRSVNQO/rQEESP 

IEVHTAEDVPXAVEVHAISEDYDIBTENNSSESLQDQTDEEPPA 

KLCTCILDKSQAIjNVTAC&KWPLLRANSSGLYK^ 

DLKQHMILKHIOITDSNVCRVCKESFSTNMLLXBIIAKI^ED 

CKY CDYKTV I FENI>SQH IADTH FSDHLYWCEQCDVQ FS S S S ELY 

LHFX3KHSO)EQYLO)FCEHETNDPEDLHSHVVNEHACKLIELSD 

KYNNGKHGQYSLI^KlTFDKClOfFFV^VC^ 

aiehtkifpiivcdixxi«;fssmi^\iakiilnshlsegiyi>cqyw 
f^stg^ifjjlkihi^fkhsadlphkcsdci^fgnfj^isniip 

VHETT 


6806 


272 


3794 


VAIiCFPNSDPVMFMDAFYGCLIiAF^PVPIEVPLTRKDAGSOX^ 
GKLLGS CXJVFLALTTDACQKGLPKAOT^EVAAFKGWP PI»S WL VI 
TOKHItAKPPKDWKPLAQDTGTGTAYI E YKTS KBGSTVGVTVSHA 
SLLAQ^TRAIiTQACXTYSBAETLT^ 

MHWSVPYALMKANPI^WIQKVCPYKARAALVKSRl^WSIiLAQ 

RGQRHVSLS SLRMLIVADGANPWS I SSCHAFLNVFQSRGIjRPEV 

ICPCA3SPEALWAIRRPPDLGGPPPRKAVLSMNGLSYGVTRVD 

TP^KLSVLTVQDVGX^PGAlWCVVKIiEGTPYL 

SSSATGTAYYGIJ^rnaiVFEAVPVTTG^APIFD 

F IGP DHLVF I VG KLIXjIjM VTGVRRHNADDVVATALAVE PMKFVY 

RGR I AVFS VTVLHDDR IVLVAEQRPDASEKDS FQWMSR VLQAX D 

SIHQVGWCLALVPAM^PKAPLGGIHISOT 

VI*ICPHTCran*PKPRQKQPEVGPASMIVG^ 

FJ^AHTiEDSDQARKFLFLADVLQWRAH^ 

STATCVQIJIKRAERVAAALDffiKGRIiSVGDHVALVYPPGVDLtAA 

FYGCLYCGGVPVTWPPKPQNLGTTLPTVKMIVEV^ 

QAVTm^IJ^KEAAAAVDIRTWPTIIJOTDIPKKKIASVFRPPSP 

DVIAYLDFSVSTTGILAGVKMSHAATSAL^ 

IAlCrJJPYCGLGFAI^CLCSVYSGHO^VLVPPLBLESNVSLWLS 

AVSQ YKARVTFCCYSVMBM CTKGLGAQTGVLRMKG VNI*S CVRTC 

MWAEERP\RIALTQSFS KIiFKDLGLPARAVSTTFGCRVNVAIC 
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amino acid 
sequence 


Amino acid segment containing signal peptide 
(A~Alanine, C=Cysteine, D=Aspartic Acid, B- 
Olutaraic Acid, F^Phenylalanine, G^Glycine, 
H=Histidine, I-Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine , 
P=Proline, G>Glut amine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=»Tyrosine, X=»Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








LQGTAGPDPTTVYVDMRMiRHDRVRLVKRGSPHSLPLMESGKIL 
PGVKVT IAHTKTKGPLGDSHLGEI WVSS PHNATGYYTVYGK EAI» 
HADHPSARLS FGI7TQTIHARTGYLGFLRRTELTDASGGRHDALY 
WGSI J5KTLKLRGMRYHPIDIETS VIRAHRS lAECAVFTWTNLL 
VVVVELDGLBQDALDIiVALVTNVVLBEHYLVVGVVV P 
INSRGEKQRMHLRDGFLADQLDP I YVAYNM 


6807 


1444 


60$ 


VGfflXTVHAMFTC3?PKCtiGFSP PVN 

. SVFQAGPQLQAXjANLBARRGSIGAAI^SRDVSGLPVYAQSGBPR 
RLTQAQVAAFPGENALEHSSDQDT WDSLRS PGFCSPLS SGGGAE 
SLPPGGPGHARAGIILGKVCDPHLNHQQPSPTSVLPTEVAAPPIiB 
KILSVDSVAVDCAYRTVPKPGPQPGPHGSI^TKGCLRSI^GDI^ 
RFPCX3MEVHSGQRELESVVAVGEAMA\l,KFPMGAMSYCIiRDRSR 
FLFRLPMGLSCPLQVQ 


6808 


2063 


737 


GVGSGAASAIiARSRPLASRItSSRRRTRAPRSGAMORIiAMDLRML 
SRKLSLYLEHQVRVGFFGSGVGLSLILGFSVAYAPYYI.SS IAKK 
PQLVTGGES PSRFIjQDHCPVVTBTYYPTVHCPJE3GRGQTI»LRPP\ 
I TS KP PVQYRNELIKTADGGQI SIiDWFDNDNSTCYMDASTRPT I 
LLLPGLTGTSKESYILHMIHLSKEIjGYRC^ 

PRTYCCANTEDLETVII^[VI1SLYPSAPFI^GV3^^MI J I^NYL 
GKIGSKTPI^AATPSVGWNTFACSESLBKPLNWLLFNYYLT^ 
LQSSVNKHRHMFVKQVDMDHVMKAKS I REFDKRFTS VMTO 
DDYYTDAS PS PRLKSVG I FVLCLNS VDDVFS PSHAI P I ETAKQN 
PNVALVLTSYGGHIGFLEGIWPRQSTYMDRVFKQFVQAMVEHGH 
EI»S 


6809 


939 


65 


DYSGG^PVPTEHGMIXYTPAQTHPEQPGSEASTQPIAGTQTVPQ 
TDEaAO^SQPLHPSDPTEKQQPKRIJIV^IPFRFRDPDLRQMF 
GQFGKIIiDVEI IFNERGSKGPGFVTFETSSnADRAREKLNGTIV 
EGRKIFAWKATARV^r^NKKTGNPYTNGWKLNPVVGAVYGPEFYA 
VTGFPYPTTGTAVAYRGAHIjRGRGRAVYNTFRAAPPPPP I PTYG 
AVVYQDGFYGAE I \liBATQPTDTLS PLQRRQ PTATVTAESTQLP 
TRTITPSGPRRPTALEPCBTFHRFLLGP 


6810 


939 


65 


DYSGQTPVPTBHGMTLYTPAO/THPEQPGSEASTQP IAGTQTVPQ 

TDEAAO/rDSQPLHPSDPTEKQQPKRLHVSNI PFRFRDPDLRQMF 

GQFGKILDVBI IFNERGS KGFGFVT FBTSSDADRARBKLNGTI V 

EGRKtEVNNATARVMTNKKTGNPYTNGWK^ 

VTG FP YPTTGTAVAYRGAHIiRGRGRAVYNTFRAAP PPPP I PTYG 

AVVYQDGFYGAE I \IJSATQPTETTIjS PLQRRQPTATVTAESTQLP 

TRTITPSGPRRPTALBPCETFHRFLLGP 


*811 


1522 


658 


DLVTVWS FVDCR VIASTHGH \ KSWVS WAFDP YTTS VEEGDPME 
FSGSDEDFX3DIJ^HFGRDRADSTQC3^RRNSTDSRPV3VTYRFG 
SVGQDTQI^WDLTEDILFPHQPLSRARTHTimiNATSPPAGSN 
GNSVTTPGNSVPPPLPRSN5LPHSAVSNAGSKS SVMDGAIASGV 
SKFATI^UTORKERHHEKDHKRNHSMGHISSKSSDKIjNLVTKTK 

tdpaicixgtplcprwbdvpdiieplickkiaherltvlifle^ 
vtacqegfictwgrpgkvvsfnp 


6812 


4001 


16-82 

• 


RDAVFS LDLSTI IQGTWFLNGEBLKSNEPEGQVE PGALR YRIEQ 
KGLQHRLILHAVTGIQD3GAI.VGFSCPGVQDSAALTIQESPVHIL 
SPQDKVSLTFTTSERVVLTCELSRVDFPA'rWYKDGQKVEESEIaL 
WKNIX5R10mLlLPEAKVQDSGEFEaRTEGVSAFFGVTVQDPPV 
HI VDPREHVFVHAI TS KCVM I JVCEV \ DR \EDAP VR W Y KDGQE VE 

esdpvvlenegphrrlvlpatqpsixk;efocvagdecayftvti 

TDVSSWIVYPSGKVYVAAVRI^VVLTCFXC^ 

B WES PALLLQKEDTVRRLVLPAVQLEDSGEYLCE iddesas ft 

VTVTBPPTOimRDEVTIiIAVTI^ 

DGUBVBESBALVLERDGPRCRLVLPAAQPEDGGE FVCDAGDDSA 
FFTVTVTEPPVQFLAIiETTPS PLCVAPGEPWIiS CELSRAGAPV 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A»Alanine, C=*Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
HaHistidine, I=Isoleucine, K^Lysine, 
L=Leucine, M=Methionine, W-Asparagine , 
P=Proline, Q^Glut amine, R^Arginine, 
S=Serine, T=Threonine, V= Valine, 
"tryptophan, Y-Tyroeine, X=Unknown, +=Stop 
Codon, /=posaible nucleotide deletion, 
\=possible nucleotide insertion) 








WSHNGRPVQEGEGLELHAKGPRRVIxCIQAAGPAHAGL^CQSG 

AAPGAPSLSFTVQVAEPPVRVVAPEAAQ'rRVRSTPGGDLELVVH 

I^GPGGPVRWYKDGBKtJ^Q^VQI^Q^^ 

GEYLCDAP^DSRIFIAfSVEBPLLVKLVS^^ 

CEVSPPDADVTWLRNGAVVTPGPQRQSCX^ 

CVSKWRO^WVQRGPCAGCEVGSPCPTTLACPWPRMGTSTASSS 

MVSYWPTRAPTAARATTIAPWPGSA 


6813 


9 


836 


SSTQQRPGVPAGPRPIJDGYIX5VADHKPLKMHCRDCALVTSSGHL 
LHS HQGSQ I DQTECVI RMNDAPTRG YGRDVGNRTSLRVT AHSS I 
QRILRNRHDLLNVS QGTVFI PWGPSS YMRRDG KGQVYNNLHLJ£»S 
QVI^RLKAFMITRHKMI^FDELFKQKTGQ\NRKXSNTWLSTGWP 
TMTIALELCDRINVYGMGPPDFCRDPNHPSVPYHYYKPFGPDBC 
TMYLSHERGRK03HHRFITEKRVFKNWARTFNIHFFQPDWKPES 
IA1NHPENKPVF 


6814 


3 


737 


KFRRQEAN/ ARKRNRMHGItNDALDinjRKWPC^ ET 
LRLAKWYIWALSEIIJQGlGtPDI^TPV^ 

CLQLNARS FLMGCGGEAAHHTRS P YSTPYP P YHS PELTTP PGHG 
TLmSKSMKPYNYCSAYESFYBSTSPECA£PQF^GPLSPPPrNY 
NGI PS LKG^ETI^YGKNYNYGMHYCAVPPRGPLGGX3AMFRI,P TD 
SHFPYDLHLRSQSriTMQDBIiNAVFHN 


6815 


906 


553 


QGIiDPASQTKVVKIiLKDGSGRRGDRRSSRDMAGGAGPRSESDLE 

DVGPTABWlK5IX5SGSLRRSGSFGKLIaT!AI^ 

PQEPPNPRMKPJVSSLNFLNKSVEEPTQPGG 


6816 


1 


803 


NLLKTHKP\Ll^DEDSI^VPVAOMGNYQBYLKTLASPIJiEID 
PDQPKRIJ1TFGNPFKQDKKGMMIDEADEPVAGPQMKVKRPGEPN 
SPMSSKRRRSMSLLLRKPQTPPTVTNHVGGKGPPSy\SWFPSYPN 
hi KPTLVHTDAT I IHDGHEEKMBNGQITPDGFLSKSAPSELINM 
TGDLMPPNQVMI^DDFTSI^KIX5LIQKP<5SNAFVGGAKNCSLS 
VDDQKDPVASTLGAMPNTIiQITPAMAQGlNADI KHQLMKEVRKF 
GR5K 


6817 


172 


3457 


IX»^SPKIGNGLPVIGPGTDIGISSr*^GYIX;KNFDSAKVPS 

DBYCPACKEKGKIJCAIJCrYRISFQESIFLCBDLQCIYPLGSKSL 

NNLISPDLEECHTPR1CPQKRKSLESSYKDSLLLANSKKTRNYIA 

IDGGKVLNSKHNGEVYDETSSNLPDSSGQQNP IRTADSLBRNE I 

LEADTVDMATi'lU>PATVDVSGTGRPSPG 

TSFPQALCVQWKNAYALCWIjira 

ES I FWUjLTTnTNQANTIXYTSQLSGVlCIX^CKKL,'rSE I FAE IET 
O^NEVRDEIFISI^PQI^CTU^ 

FLYSFSi^FECSQOGHQYQNRHMKSLVTFTNVI PEWHPLNAAH P 

GPCNN CNS KSQIRKMVLEKVSPIFMIiHFVBGLPQNDIjQHYAFHF 

EGCLYQITSVlQYRANKHFITWILnADGSWLBCDDIiKGPCSERH 

KKFEVPASEIHIVIV^KISQVTDKEAAO^LKKim>0HALSNE 

KPVSLTSCSVGDAASABTAS VTHPKDI SVAPRTLSQEXTAVTHGD 

HLLSGPKGLVDNILPLTLEETIQKTASVSQIJ^SEAFL\liKNKPV 

AEimSXLKTNTLO^QESrj^ASSVaAPCNEKLIQDQFVniSFPSQ 

VVNTNMQSVQ1OTEDTVOTKSVNNTDATGLI 

LHQFXTPKTEQIjaPERVTSQVSNLKKKETTADSQT^ 

SLKr^QKK^F^SWVKGLISRGASPMPLCVfiAHNRNTITDLQPS 

VKGVNim^GFXTKGINQKASm^KKARiCSASKPPPISKPPAGPP 

S SNGTAAHPHAHAAS EVLEKSGSTS CGAQLNHS S YGNG I S SANH 

EPLVEGQIJiKLRIiKLRKKLKAEKKKIiAAI^ 

QVPQDGS PNDCHS IEDLLNEI*P YPIDIANESACTTVPGVSLYSS 

QTHF, R 1 LAK LLS PT P VKTELS ENGEGD FR YLGMGDSI 1 I PPP VPS 

EFNDVSQNTHLRQDHNYCS P TKKNP CEVQ PDS LTNNACVRTIiNL 

ESPMKTDI FDEFFSSSALNALANDTLDLPHFDEYIjFENY 


6818 


2 


240 


RGFDKVLWT/I^GAVK\CVQFSRISPIX3EEGYPGELKVWVTYTL 
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SEQ 
ID 
NO-. 


Predicted 
beginning 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C= Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Hi3tidine, I=Isoleucine, K« Lysine, 
I. ^Leucine, Methionine, N=^Asparagine , 
P=Proline, O^Glut amine, R=Arginine, 
S=Serine, Threonine, V^Valine, 
W^Tryptophan, Y^Tyrosine, X- unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\apossible nucleotide insertion) 








DGGE/ LHS / ATTEHKP / VQATPVNLT \TI LTSTWQARLPQI 


6819 


1 


961 


G IPCTBMGNFDRftNVTGEIEFAIHYCFKT^LEl CIKACKNIiAY 
GBEKKKKCNPyVKTYLI*PDRSSOGKRKTGVQRNTVDPTFQETLK 
YQVAPAQL VTRQLQVS VWHLGTIiARRVFLGKVX I PZaATWDFBDS 
TTQSFHWHPLRAKADKYEDSVPQSNGEIjTVRAKL^ 
EAOEGTDQPSLH^LCLVVLGAKMI^^ 

DQQKLRLKSPVLRKQACPQWKHS FVFSGVTPAQIiRQSSLELTVW 

DQALFGMNDRLLGGT\RLGSKGiyrAVG^ 

PNLWTDMTLVLH ' 


6B20 


1014 


340 


GDMWIVGHVPPGFFBKTQNKAWFREGFNEK^ 
GQFFGHHHTDS FRMLYDDAGVP I SAMFIXPGVTPWKTTLPGVVN 
GANNPAIRVFEYDRATLSLKDMVTYFM^ 

QLTEAYGVPDASAHSMHTVIiDR IAGDQ3 TLQRYYVYNSV3 YSAG 

VCDBACSMQHVQVMRQVDIDAYTTCliYASGTTPVPQL 

LGIiCT 


6821 


1088 


518 


EFDIYR/BVGGEFVPVTRDDSSNGFPRTQHGPSPTVHPIQSPQM 
RPCVT.TriDPETLPAIATTDIDVLFYSHSTPKEAASSSPBPSSIT 
FFAFSLIEGYI \SlVb(DAETQKKFPSDLLLTSSBGELWRMVRlG 
G0PI/3FDECG1 VAQ XAGPLAAAD I SAYYISTFNFTDHAIjVPEDGI 
GSVI EVLQRRQEGLA6 


$822 


1088 


518 


efdi yr/evggbfvpvtrddssngfprtohgps ptvhpiospqn 
rfcvltldpetiipaiattlidvlfyshstpkeaassspe pss it 
ffafsliegyi \sivmdaetqkkfpsdllltss3gelwrmvrig 
gqplgfdecgivaqiagplaaadisayyistfnfdhalvpedgi 

GSVIEVLQRRQEGLAS 


6823 


654 


221 


ppkllsrwarmghgdeivVlsdlnfpgllhlpwgpwrsvqtac 
glpqiibavlkllplekty^nsspaavmelvpsdkerglqtpvwte 

YBS I LRRAGCVRALAKI ER FE FYERAKKAFAVVATGETALYGNLi 
ILRKGVLAI^PIiL 


6824 


858 


104 


ALPKLGICAGI^VVI^3LPFIjLEWPSGYLSRS FDLGRQFLFHWTV 
iW RFLPEALFLffitAFHLALI/T^ X L3 
LLRDPSKRKVPPQPLTPNQIVSTLFTSNFIGICFSRSLHYQFYV 
WYFHTLPYLLWAMPARWLTHLLRIiLVIjGLIELSWNTYPSTSCSS 
AAI«ICHAVlLIiQLWLGPOPFPKSTQHSKKAH 


6825 


3 


1173 


SSGEFGLQASD XHWTISDTGff IIiI I LCSLMEPWAIiGACTFVHIjL 
PKFDPLVILKTLSSYPIKSMVKAPIVYRMU^DI^SYKFPHLQ 
NCIjAGGESLLPETLENWRAjQTGLDIRBFYGQTETG 
KIKPGYMGTAASCY0VQI IDDKGNVL PPGTEGD IGIRVKP I RPI 
G I FSG YVDNPDKTAAN I RGDFWLLGDRGIKDEDGYFQFMGRADD 
I INSSGYRIGPSEVENALMBHPAWETAVISSPDPVRGEWKAF 
VILALQFLSHDPEQLTKJBLOX2HVKSVTAPYKYPRK I EFVLNL PK 
TVTGK IQRA\ KLRDKEWKMSGKAPCAVRHLRDIHLDS PLLSLS F 

DPf^OT>2kT>'DMnr7vrn^QT.um7ui7Viri7r ,T T.&T wtctbt vutidp 
f r o rAiHJUrrlUU I vxJJoIjVyJSIuIIS I R-rL.J_iAJjV.Lo 1 IVlf X H VKC 


6826 


2304 


954 


LKTES FKPW/ VNIALAFHIJjGERASPNSF^ 

LYFEEDEVRYLQSTOAj^TOWSQYKjTTARQYAYTYK^ 

NKL*PLllBSFTYBI>YltWAVSSVMTROJH PLW 

DMCNHTNGL ITTGYNLEDDRCT I Y I FYGTRS N 

AEFVIHSGFFFDNNSHDRVIlTKLGVSR^D^^ 

PTSS^FALHFTEPPISAQLLAFl^W 

R I FTI^NSEFPVSWDKTCVKLWTFJ^ 

VLKNHDLS VRAKMAI WjRj^EKEII^IlAVKSAAVNP^YYRQQME 
EIO^J^KYEESNjUJIJjESSVGDSRLPLVLRj^ 
REAISKAKATENGLVNGENS I PNGl^B^ 
GSSSDSTAGVKE 
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SEQ 
ID 
130: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C-Cysteine, D^Anpartic Acid, E- 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H»Histidine, I^Isoleucine, K~Lysine, 
L-Leucine, M=Methionine, N^Asparagine, 
P~Proline, Q-Glut amine, R-Arginine , 
S=serine, T=Threonine, v=»Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6827 


1 


779 


SSVVEFGI^VIXGLFLLFVLENMIiGLLRHRGLRPRCCRRKRRNL 

ETRNLDPENOSGMALQPLQAAPEPGAQG0REKNSQHPPALAPPG 

HQGHSHGHQGGTDITWMVLI^DGIjHNLTIW 

LSTTIAVFQJELPHELGDFAMLLG^GLSFRR^ 

GGAVLGVGLSICPVPLTPWVFGVTAGVPLYVAJ^V^ 

GAPAxA\HVIiLQGI^I^J«^CLMLAITIJ,EB^ 


6828 


3 


1654 


KSQHO / WI LQLMH S CKEG WKDI*KGNI^IiHRAMLDl»DNGTRPS K 
LGHLSQTASLKRGS S FQSGRDDTWRYTCTPHRVAFVRKI*TKLVI»S 
QLPNFWKLWISYVNGSLFSETABKSGQIERSKNVRQRQNDPKKM 
IQBVMHSLVKLTOGALI^I^ IRIXSBAKQYGGWKVKOTLSGQWIiA 
HAIQTVRI/THR SI/TALE IPNDLLO/TIQDLIIJ)IJlVRCVMAXLQH 
TAEEIKRIAEKEDWIVDNKGLTSLPCQFRQCIVCSLQSLiCGVI>B 
CKPGEASVPOX>PKTQBEVCCLSINIMOyPIYCLEQI>STKPDA^ 
DTTHLSVDVSS PDLFGS IHEDPSLTSEQRI1LIVI1SNCCYI1BRHT 
FLN I ABHFEKHNFQG I EKITQV SMASLKBLDQRIiFKNY I ELKAD 
P I VGSLEPG I YAG YFDWKDCIiPPTGVRNYLKBALVNI IAVHAEV 
FTISKELVPR\n^KVIKAVSERLSRLt^CVSSPSKNGALO^RLK 
I CALRDIVAVYLTPESKSSFK!OAI<EAIf 
KFKSSMHLQLTCFQAASSTMMKT 


6829 


1 


782 ; 


MRMEAGEAAPPAGAGGRAW3GWGKWVTU^ 

EQKSFI^RLCO^EELQSDRDETC^YblDRDPTYFGPII^FIJlHG 
KLVLDKDMAKEGVLEEAEFYNIGPLIRIIKDRMBEKDYTVTQVP 
PKHVYRVI^CQBEBLTQMVSTMSDGWRFKQLVNIGSSYNYGSED 
OAEFLCWSKBLHSTPNGLiSSESSRKTKSTEEQLEEQQQQEEEV 
BBVEVEOVQVEADAQEK/ CCYKPBAPGCEAPDHbQGtiGVPI 


6830 


1 


939 


MKPGS VKNLS IVYRSRDFLVVNKHWDVRIDSKAWRETLTLQKQL 
RYRFPELADPDTCYGFRFCHQLDPSTSGALCVAIiNKAAAGSAYR 
CFKERRVTKAYLALIiRGHIQESRVT I SHAIGRNS TEGRAHTMCI 
EGSQGCENP KPS LTDL WLEI IGLYAGDPVS KVLJjKPLTGRTHQL 
RV\HCSALGHPVVaDLTYGEVSGREDRPFRMMLHAFYLRIPTDT 
ECVEVCTPDPFLPSIiDACWSPHTLI^ 

DRGPRPGSPSAIiLPGPGRPPPPPTKPPETEAQRGPCLQWLSEWT 
LBPDS 


6831 


3 


1087 


SI^TGSSTPDNKVAEQEDLETQPSPSVEKAV'XVXDPBGTIP'I'NP 
NVAKKPAI^LS5VKLKTADBPRGTLVKSG1XK2NVKEESMILSN 
VEDLQQPKF I SEVSREDYGKKEI SGDSEEMNINS VVTSADGENIj 
EIQSYSLIGBKLVMEBAKTIVPPHVTIJSKRVQKPAIAPPSKWNI 
SIFKEEPR3DQKQKSLLSFDWDKVPQQPKSASSNFASKNITKE 
SEKPES 1 1LP VEESKGSLIDFSEDRIiKKEMQNPTSLKISEEBTK 
LRS VS PTEKKDNIjENR\ S YTL\AEKKVLAEKQNS V\APIfEI>RDS 
NKIGKTQITIXSSRSTELKESKAE^POHFyQNEDYNERPKI IVG 


6832 


1809 


412 


MGSGLISGPPQDNSGEALKEPERAQEHSLPNFAGGQIIFFEYLLV 

VSLKKKRSEDDYEPI ITYQFPKRENIjIiRGQQEEEBRIjLKAIPLF 

CFPDGWEWASIj i K x PRE X tT SFVIiTH VlXiSRKIUI CJiKJJjFiVGPG 

PRLPKVYCI ISCIGCFGLFSKItDBVEKRHQlSMAVI YPFMQGE. 

REAAFPAPGKTVTLKSFIPDSGTKFISLTRPLDSHLEHVDFSSL 

LHCLSFEQILQ I PASAVLER KI I FLAEGLSTLSQ CIHAAAALL Y 

PFSWAHTyiPWPESIiLATVCX:PTPFMVGVO>1RFQQEVMDSPMH 

EVLLVNLCF^TFLMSVGDEKDILPPKIXJDDIIjDSLGG^INELKT 

ABQINBHVSGPFVQFFVKrVGHYASYIKREANGO<3HFQER3FCK 

ALTSKTNRJRFVKKFVK^LFSLFIQEA^ 

YBEQKKQ/TETKGKNCEIRAWWKND 


6833 


1 


1129 


PLMTI>SQCGGIPGHGHSHGGHGHGHGLPKGPRVKSTRPGSSDIN 

VAPGEOGPDGJ2ETNTL\mOTSNSNGM 

QVNGNLVREPDHMEI^EDRAGQLNMRGVFI#^ 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 

to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alaniue, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F^Phenylalanine , G^Glycine, 
H=Histidine, I-Isoleucine, K-Lysine, 
L— Leucine, M=Methionine, N=Asparagine, 
psProline, Q=Glutamine, R=Arginine, 
S =S erine , T=Thr eonine , V=»Val ine , 
W=Tryptophan, Y=Tyrosine, X=*Unknown, *«Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








HALVPYFSWKGCSEGDFCVNPCFPDPCKAFVKIINSTHASVYEA 

GPCWVLYLDPTLCVVMVCILLyTTYPL^KBSALIIiQTVPTO 

IRNIiIKELRNVEGVKKVHBLHVWQLAGSRI IATAHIKCEDPTSY 

MKVAKTIKDVFHNHGXHATTIQPEFASVGSKSSVVPCELACRTQ 

CALKQCCGTLPQAPSGKDAEiCrPAVSISCLELSNNLBKKPRRTK 

ABNIPA\WIEIKN\IPNK\QPESSL 


6834 


78 


1151 


AGQERPAPIWRLLWLPTPSVSRKABPAHIPINR*GA*E*RGGLP 
IXX5SSASAYGWH*M»TPHSPGGS*HM*SSKAPVTQARBVLV7«3P 
CSKLVLSGARGIVGTTVQ\^VEAQQPIJ»I^ 
SRAL*LIB£VTQYttUlARXGNAVVG<3V^ 

•» nironnrcilY Tnro/ , r*vr^B17U^VDT.*\l1>n^JV*UT?HA^2<^VRPf3rt 
AAAAVKiJv^KJsVij 1 VovrlJJ^yyfVlSVl> Vr<JU w VKliVv*viiJl/vc»v,VE»rV7U 

AHGUPGIJU>AKGRGGTNBVKEQVQVDGVQKI.VI*SAIffiClIELVAG 
QQDGEDQAARTRLIiQAGAHSVAHGRRQGQAP CRPHQEAGVS CHE 
LQQWGDAIi *ARB*APQI IVLIiI»I>EDVAQLRTGKKA*DBVVDVE 
QLLRQIi 


6835 


1 


834 


GIPAADRNEASI^LIKLDISRTFPN^IFQC^GPxHDm^ILG 

AYTCYRPDVGYVQGMSFIAAVLIUJLDTluT^ 

MAFFRVDHGIjMLTYF AftJr JS V r J?ivKNJ^Jrvi>r/ulr KAnnL x rux x jj 

IDWIFTLYSKSLPIJXI^CRIWDVFCRDGE^ 

DILTKMDFIHMAQFIiTRIiPEDIjPABEILFAS iatiqmqsrnkkwa 

QVLTALQKDSRKMRBGKSWPTLRIXJREFA^ 

FRIiT PGQPKRTDAIi 


6836 


l 


850 


mscgrpppdVdgmitlkv\dnltyrtspdslrrvfkkygrvgdv 
yi prep,htkaprgfafvrfhdrrdaq1^aamdgaeijx3relrv 

QVARYGRRDLPRSRQGRRHAAGPBAA/RYGRRSRSYGRRSRSPR 
RRHRSRSRGPSCSRSRSRSRYRGSRY5RSPYSRSPY3RSRYSRS 
PYSRSRYRESRYGGSHYSSSGYSNSRYSRYHSSRSHSKSGSSTS 
SRSASTSKSSSARRSKSSSVSRSRSRSRSSSMTRSPPRVSKRKS 


6837 


1 


1369 


TDGAAVAGNPGSDYFPGGTAP/G6PRTRRP\SGTSSSGSKASGP 
PNPPAQGIXrrSLSPNYTLKSTSGNDGKPVSGGGGRGRGRRKRDS 
GHVSPGTFFDKYSAAPDSGGAPGVSPGQQ0ASGAAVGG6SAGBT 
RGAPTPHBKALTSPSWGKGAKLLLGDQPDLIGSLDGGAKSDSSS 
PNVGB FASDEVSTSYANEDEVSSSSDNPQALVKASRS PLVTGS P 
KLPPRGVGAGEHGPKAPPPAUGLGIMSNSTSTPDSYGGGGGPGH 
PGTPGLEQVRTPTSSSGAPP PDE IHPIiEILQAQ I QLQRQQFS I S 
I^PIjGIJ<GGKKGECAVGASGAQNGDSBLGSCCSEAVKSAMSTI 

dldswiaehsaavraipadkalvdsadddktiapw 

keahdi^ankasasqpgshiocxsvhctddvgdakart^vptwr 

slhsdisnrfgtfvaalt 


6838 


16 


499 


LTDTPPPK1HMIHHS ISDYIQVTIjRCWALGFYPME ITLTWQQDEB 
DQTRDMELVETRPAGDGTFQKWAAVVVPSGKB/Q/RYMCHVQHE 
GLPEPI»TI»RVIBQSSQPTIPIVGlVAGLVLtfiAVVTGAWSAVMC 
RKKNSDRVSYSEAflSSDHAQGSDVSIiTACKV 


6B39 


1 


1195 


AAPAGGGPDPKAI^AFPGRHLSGLSWPQVKRIJQALLSEPIPIHG 
RGNFPTLSVQPRQIRAG?GPQHPGGAG\ IHVHRVRLHGSAASHVL 
HPESGLG YXDLDLVFRMDLRSE/VS FQtTKAVVLACL^FLPAGV 
SRAKITPLTIiKEAWQKLVKVCTDSDRWSIjI SI^NKSGKNVBIiK 
FVDSVRRQFE FS IDS FQ 1 1 LDSIXI*FGQCSSTPMS EAFHPTVTG 
ESLYGDFTEALEHLRHRVIATRSPKEIRGGGLIiKYCHI»LVRGFR 
PRPSTDVRALQRYMCSRFFIDFPDLVEQRRTLERYLEAHFGGAD 
AARR YACLVTLHRWNESTVC3IJ^nu^RRQTLDL I AALALQALAE 
QGPAATAArj^PPGTDGVVPATVNYYVTPVQPLIAHAYPTWLP 

Of 


6840 


4254 


2061 


ELQGDF3VPDVPKSMAWCENS ICVGFKRDYYL IRVDGKGS I KKL 
FPTGKQIjEPLVAPIiAIX5KVAVGQDDLTVVI2*IEEG I CTQKCALNW 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(JWManine, C-Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F- Phenylalanine, G=Glycine, 
H^Histidine, I=Isoleucine, K= Lysine, 
L-Leucine, M=Methionine, N^Asparagine , 
P=Prol ine , Q^Glut amine , Rs=Arginine , 
S=$erine, T«Threonine, \Mteline, 
W=Tryptophan, Y^Tyrosine, X-Unknown, *=3top 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








TDIPVAMEHQPPYIXMrtiPRYVEIRTFEPRLLVQSIELQRPRPl 
TSGGSNI I YVASNHFVWRLI PVPM^TQIQQLLQDKQFbLAIxQLA 
EMKDDSDS E KQQQ Hffl I KNIjYAPNLFCX>KRFDESMQVFAKLGTD 
PTHVMGLYPDIiIiPTDYRKQI^YPNPLPVLSQAEI^KAHLAIjIDY 
LT0KRSQLVKKM4DSDHQSSTSPLMBGTPTI KSKKKLLQ1IDTT 
LLKC5fLHTNVALVAPLI»RIjBNNHCHI I 
LYBKXGIiHEKAl^JVLVDQSKKANSPLK^ 

HLI FS YSVWVIiRDFPEDGLKI FTKDLPEVESLPRDRVLGFLIEN 
FXGLAI PYIjEHI IHVHKETGSRFHNCLIQLYCEKVQGLMKKYLL 
S FPAGKTPVPAGEEEGELGEYRQKLXMFLE I S$ YYDPGRL I CDF 
PFDGIJ^BRALLLGRMGKHEQALPIYVHILIOJTRMABBYCT 
DRNKDGNKDVYLSLLRMYLSPPS IHCIjGP I KLELLEPKANLQAA 
LQVLELHHSKLDTTKAIiNL^ 

RFNQVLKNIiLHAEFLRVXQBBRlLHQQfVKC I ITEEKVCMVCKKK 
IGN3AFARYPNGVWHYFCS\KEVNPADT 


" 6841 


1 


3206 


TPSTTCTKBNTPTSSVPSAAVTPI^SLQPtGDYGVGSKNSKRA " 

RBKRDSRNMEVQVTQKMRNVSIGKGSSDBWSDVQD1 IDSTPELD 

MCPETRLDRTGSSPTQGrVNKAFGIlTnDSLYHEL^ 

VDEGADLLGEFSGMGKBVGNIjLIiEHSQIjIjBT^ 

KVDQI^GBQEVLROELEAAKQAKVKIjENRIKELEEBIjKRVKSEA 

1 1ARRE PKERAEDVSS YLCTESDKI PMAQRRRFTRVEMARVLMB 

RNQYKERLMELQEAVRWTEMIRASREIIPSVQEKKKST1WQFFSR 

LFSSSSSPPPAKKPYPSGMIHYKSPTTAGFSQRRNHAMCPISAG 

SRPLEFFPDDDCTSSARREQKRBQYRQVREHVRWDDC3R1QACGW 

SL PAKYKQLS PNGGQEDTRMKNVPVpVYCRPLVEKDPTMKLWCA 

AGVtn^GWRPNEDDAGNGVKPAPGRDPLTCDREGDGEPKSAHTS 

PEKKKAKELPEMDATSSRVWILTSTLTTSKVVI IDANQPGTWD 

QFTVC^AIIVLCI33IPAA3I^DYPP<3EMFLBSDVKPEDPGADGV 

rJU3lTLVGCATRCOTPR5KCSSRGDTTVIiDKGOG 

NPSQSTEEATEArBVPDPGPSBPETATLRPGPLTEHVFTDPAPT 

PSSGPQPGSBITCPEPDSSSTRPBPEPSGDPTGAGSSAAPTMWLG 

AQNGWLYVHSAVANWKKCLHS IKLKDSVLSLVHVKGRVLVALAD 

GTCAI FHRGBIX3QWDLSNYHLMDLGH PHHS IRCMAVVYDR VWCG 

YKNKVHVTQPKTMQIBKSFBAHPRRESQVRQL^ 

IJLSTT^LYHAHTHQHlvaDVDrBPYVSKMIX^ 

LVAGSRLWVGTG^WttSIPLTBTVVIJ^^ 

TSGEG \ARPGG\ I IBOTG\DDSSDRAARSFIPYCSMAQAQI^FH 

GHRl^VKITVSVPGNVIJVTLNGSVI^SPABGPGPAAPASEVF^ 

KLiRNVLVLSGGEGYI DFR I GDGKDDETEEGAGDMSQ VKPVLS KA 

ERSHI I VWQVS YTPE 


6B42 


3 


926 


RCOXJI^ATILTDHQYIiERTPIXMLKQKAPOXJYRlRAKLRSYKP 
RRLFQSVKLHCPKCHLLQEVPHEGDLDI IFQDGATKTPDVKLQN 
TSLYDSKIWTTKNQKGRKVAVHFVKNNGILPLSNECI1I1L T EGGT 
LSBICIOiSNKFNSVIPVRSGHEDLEIJ^LSAPFLlOXSTVHK^ 
KQWST* RS IQNl^SLVDKTSWI PSSVAEALGI VPLQYVFVMTFT 
LDDGTGVLBAYLMDSDKFFQI PASEVLMDDDLQKSVDMIMDMFC 
P PGI KI DAYWI^CFI KS YKVTNGTDNQ I CYQ I FDTTVABDVI 


6843 


2 


851 


NHRKVLSGAKRYKCNEOGKSFAYTSSLI KHRRIHTGERPYBCSE 
CGR^PAENSSIiIKHLRVHTGBRPYECVECSRSFRRSSSLLQHQR 
VHTRERP YECSECGKSFSLRSNLIHHORVHTGERHE CGQCGKS F 
SRKSSLI IHLRVHTGER^YECSDC^PCSFAEl^SLIKHLRVHTGE 
RPYECIDCGKSFRES SSFRRHQRVHTGMRPYK*SKFWKFSCPGF 
LLLQGQRVHTGSRCYECDKWG I FFS*HASFFT* KSAPTEEVPPE 
CNECEKAPSPIiSLVTTlFT 


6844 


244 


642 


EHQIAGFELRKTQTSMSLGTTREKTDRV^ 

QYDVKSEIYSFGIVLWEIATGDIPFQGCNSEKrRKLVAVKRQQK 
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amino acid 
sequence 


Amino acid secrment ^nnt-Ainina « •» <-rr>» 1 -i h - 

■*-*•«• Bb^iiicuu Luatflumig ii LyiitlX pep title 

{A=- Alanine, C^Cysteine, D=Aspartic Acid, B«= 
Glutamic Acid, F= Phenylalanine , G=Glycine # 
H^Histidine, I-Isoleucine, K=Lysine. 
L=I»eucine» M=Methionine, N=Asparagine , 
PaProline, Q=Glutamine, R=^Arginine, 
S ^Serine, T=*Threonine, V«Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *e=stop 
Codon, /^possible nucleotide deletion, 
\»po3sible nucleotide insertion) 








PLGEDCPSBLREIIDECRAHDPSVRPSVDBII*KKI*STFSK*CIK ' 
I 


6845 


3 


1519 


VAV WKH VX Wi^JL/I^MJjJUf IIjMCIIPETA 

C^LKNA^I^YQGAKFAWESADSGLBVCPEDIYGVQEVHVNGAV 
GIAFELYYRTTQDLQLFREAGGWDVVRAVABFWCSRVEWSPRBB 
KYHLRGVMSPDBYHSGYNNSVYT^^ 

IPSQmiAVADKrKVPFDVEQNPHPKFDGYBPGEVVKQADVVI^ 
YP VPPSLS PDVRRKNUS I YEAVTS PQGPAMTWSMFAVGWMELKD 
AVR ARGLLDRS FANMAK PFKVWTKNAIX?SGAVNPlilt3MGGFT^QA 
VVFGCTGFRVTRAGVTPDPVCIiSGISRVSVSGIFYQGinCCiNFSF 
SBDSVTVBVTARAGPWAPHLEAKLWPSQSRLSIjIJXIHKVS fprs 
AGRXQMSPPKLPGSSSSEFPGRTFSDVRDPLQSPLWVTLGSSSP 
TESLTVDPASB^SGTGASBTSUSPSLWPRLHPPLLG'riiLACHPS 
PAARIiSGKVHAAWPBFKAFCIi 


6846 


213 


1258 


IiYFLKTIK* LWRLAEHP * YENEKbTKLRNTIMEQYTRTKESARG 
1 1 FTJCTRQSAYALSQW I TENEK FAE VG VKAHRL IGAGHSSEFKP 
MTQNEQKKVISKFRTGKINIiLIATTVAEBGIiDIKECWrvnCRYGL 

vtoeiamvoargraradestyvlvahsgsgvi ehetvndfrekm 

MYKAIHCVQNMKPEE YAHKI LELQMQS IMBKKMKTKRNIAKHYK 
NNPSLlTPliCKNCSVIiACSGEl)IJ{VIEKMHHVNMTPEFKELYIV 
RENKTLQKKCADYQINGB1 ICKCGQAWGTMMVHKGIiDLPCIiKIR 

nfvwfknnstkkqykkwvelp itfpnldysecclfsded 


6847 *~ 


1450 


348 


SMC WNSDRLEMPH DIJUjILYPPS Y VPYTGHLSDDSLSRKYCL'i? "* 
WFEDALNGVL* RABAIQPHCVNAGDRMEKFRQKYWNKLQTItRQQ 
PFAYGTLTVRSIiDTREHCLNEFNFPDPYSKVKQRKNGVALRCF 
PGVVRSlJlM^WEERQI»ALVKGIiLAG3tIVFDWGAKAV^ 
YFGFEEAKRKLQERPWIjVDSYSEHLQRIiKQPPHKCAI.1 FADNSG 
IDI I LGVFPFVREX^J^GTEVlIACNSGPALNDVTHSESLIVAE 
RIAGMDPVVHSALRT^HTiT.T.VQ^SSSPCIiDLSRLDKGIJUy^W 
ERGAJ>I*WIBGMGRAVHTNYHAAIiRCESLKtiAVI KNAWLAERLG 
GRLFS VI FKYB VPAE 


684B 


19 


16 


QMUUM^T TV1TDWTUT CMDVVDMTT or iymY VfT ncnrr un'-Kn'min " 
MOT WWW la JjJLXjp XJtQi J. V J^Wi^KK.KJN 1 Lth JLiAMJjKSI^SDI LHDADSND 

LKVI IISAEGPVFSSGHDLKELTBEQGRDYHABVFQTCSKVMMH 
IRNHPVPVIAMVNGLATAAGCQLVASCDIAVASDKSSFATPGVN 
VGLFCS rPGVALARAVPRKVALF»MLFTCBPISAQEAI^^ 
VVPEABLOBBTMRTARKIASI^RPWStGKATFYKQI^ 
YYLTSQAMVDNIiAIJ*lXX5KGITAFlX)KRKPVWSHEPV^VEH 


6849 


70 


821 


SLGVIX^CTiP^SpAPRPOTDTSP* P VGNWATQQEDLYHQS YEC 
VCVLFASVPDFKEFYSESNINHEGLECIiRLLNE I IADFDELLSK 
PKFSGVBKIKTIGSTYMAATGI^TSG^DftQQDABRSCSHLGTM 
VEFAVALGSKI*DVINKHS FNNFRLRVGLNIKJPVVAGVIGAQKPQ 
YDIWGNTVIWASRMBSTGVLGKIQVTBJSTA^ 
VIKVKGKGQLCTYFLNTDIiTRXGPPSATLG 


6850 


2 


1235 


ARGIjraEWTFFKLRQHISRNAQDKQEIjHIJMI£GVPnA\m3I.TD 

LDVbKI^iPFAKlPAKlSQ>m^EI>HLCHCPAXVEQTAFSFL 

imHLROLHVKFTDVAEIPAWVYlJJCNL 

Gt J?S LRELRHLKILHVKSNLTKVPSNI TO 

KIXVLfiSIJOttlMNVABLBIX^ 

NNIRTlF£IISFQHI,KRLTCIiKr>HHNKIVTIPPSITHVKNLESL 

YFSNNKLESLPVAVFS LQKIaRCIjDVS YNN I S MI P IE IGLLQNLQ 

HLHITCNKVDILPKQLFKCIKI^TLNLGQNCITSLPEKVGQLSQ 

LTQLELKGNCIJ!)RI,PAQIJC;QCRM^ 

EALNQDINIPFANGI 


6851 


1765 


660 


VSAQVSAREQENCLGWNLADS SQESYKSLEEARDCYPPSLLTLD 
I^LFNOVEQGPIiI^CPKAGTDLSHGRAREVGWMAAGtWIG^ 
CYCVY KLT IGRDDS EKtiEEEGEE EWDDDQELDKEE PD 1 W FD FET 
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beginning 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
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Predicted end 
nucleotide 
location 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, OCysteine, D=Aspartic Acid, B= 
Glutamic Acid, F«Pbenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=£ysine, 
L=Leucine, M=Methionine, N=Asparagine . 
P=Proline, Q^Glu t amine , R^Arginine, 
S-Serine, T=Threonine, VWValine, 
W=Tryptophan, Y=Tyrosine, X^Unknown , *=Stop 
Codon, /-possible nucleotide deletion, 
^possible nucleotide insertion) 








MARPVrrKKSDWTEPGAPGGTEDRPSGGGKANRAHPIKQRPFPYE 
HKNTWSAQNCKNGS CVLDLSKCL?I QGKLL FAK PKDAG FP FSQD 
INSHIiASLSMAROTSPTPDPTVRKALCAFEmJJASIESQCK}!™ 
YINEVCRKTVBRCQfS FE^X^AGLNLLrSMTVINNMLAKSASDLK 
FPLISEGSGCAKVQVLKPIJ^LSEKPVLAGFXVGAQMLFSFMSL 
FIRNGNRBI LLETPAP 


6852 


1 


407 


RTRGEETYANFIKHNDGKNI FYAARTPATLFAVMFAMYI ISOLT 
GFI GUNS IAVLCNLVMGLAIjI FLCTWAYVKYSGK PRB IGTVTDQ 
IAETIjlffiQVI»KPIjGDBn^EBNIRQSVTNSIKAGLTD 
KTD 


6853 


3 


469 


GDS CAVCIELYKPNDL VR ILTC2JHI FHKTCVDPWJULEHR TCPMC 
KCD I LKAXG I EVD VEDGS VSLQVPVSNE IFWS ASSHBEDNRSET 
ASSGyASVQGTYEPPI^EHVOSTNEELQLVNHKAMSVAVDVIPH 
VDNPTFEKD ETPNQETAVRBI KS 


6854 


1148 


585 


HESYlGTFDPGFXCVCAAIQWLQDNSASxFLNRKIiVYEPSTQAK 
PVKMTFIjRMWIYSHHiyQQDLRKKI bDVGKRI»DVTGFCMTGKPG 
I ICVEGFKEHCEEFWHTIRYPKWKHISCKKAESVETEGNGRDLR 
I»FHSFBELLLEAHGDYGI*R^ 
LFGIESKSSDS 


6855 


1913 


1148 


GRVGGRVGR I CSPLSGANEY IASTDTIjKTEEVI^FTDQTDDIiAK 
EEPTSl^RBSETKGESGLVTiEGDKEIHQlFEDLDKXIAIJ^ 
YIPEGC^QRWAAEMVVAtDAriHREGIVCRDLNPNNILLNDRGHI 
QliTYFSRWSEVEDSCDSnAI ERMYCAPEVGAI TEETEACDWWSL 
GAVLFEIXTGKTLVECHTASIimOTLNMPEWVSEEARSLIQQL 
LQFNPLERLG AGVAGVED I KSHPFFT PVDW AELMR 


£856 


1617 


' 997 


VTOLYVSVDASTKDSl^rDRPLFKDFWQQFIJ)SLKAIiAVKQQR 
TVYR IiTLVKAWNVDEIjQAYAQLVSIiGNPDFI EVKGVTY CGES SA 
SSLTMAHVPWHEEVVQFWELVDLIPEYEIACEHEHSNCLLIAH 
RKFKIGGEWVmJINYlTOFQELIQEYEDSGGSKTFSAKDYMARTP 
HWALFGASBRGFDPKOTRHQRKNKSKAISGC 


6857 


1 


617 


KGPEATAMVCVCSHPNCRQNHIKPSHSAAQTWCGS PTPASAPNH 
RimMEQGICTLPSATEDAKEEGLBAQISRLABLlGRLBSKALWF 
DLOX)Rl^EEDGTNMHIiQLVRQFJ4AVCPEQl»S E FZjDSLRQ YLRGT 
TGVRNCFH I TAVRLSIX3FTFVr!in3FWBTBEAWKRHIjQ3 PLCXAF 
RHVKVDTLSQPEAIiSlUIiVPAAWCTWBRD 


6858 


2 


669 


RSRGIKDFENDPPIiSSCGI FQSRIAGDALM5SGIRISSVFASPA 

IiRCVQTAKLILBELKLEKKrKIRVEPGIFEWTKWEAGRTTPTLM 

SLFJ^KRANFNIDTD YRPAFPLSALM PAES YQEYMDRCTASMVQ 

rVNTCPQIXK3VTLlVSHGSTIJ)SCTRPLLGLPPRECGDFAQLVR 

KIPSLGHCFCBB»KEBGKirai»VNPPVKTLTHGANAAFl^ 

GN 








6859 


1 


1150 


GETMFKKAKTKAKKKPRKRSOSSGGYNLSDI IQSPSSTGLLKSG 
KTINSVBSI^BLLTSDSEGS YAGVGSPRDLQSPDFTTGFHSDKI S 
AKV KFYVNGT3 P VYSREDIiKPWEKS V X bKISAPQP XPSNRIDTT 
SSASWVAGSFSPVSPPWDLRTTMEIEESRQKCGATPKSHLGKT 
VSHGVKI^KQRKMIALTTKENNSGMNSHFJVnFTPSKAPKPVN 
AWASSIJISVSSKSFRDFIiBEICKSVTSHSSGDB^KKVSFKGIEll 
SQAPKI VRCSTHQTPGPEGNHI SDLPLLDSPNPWLSSSVTAPSM 
VAPVTFASIVEEEbQQEAALIRSREKPLAIjIQIEEHAIODLLVF 
YEAFGNPEBFVIVERTPQGPLAVPMWNKHGC 


6860 


1889 


1515 


DKDKKRQKKRGIFPKVATNIMRAWLFQHLTHPYPSEEQKKQI.AQ 
DTCLTI LQVKNWF IHARRI XVQPMIDQSNRAVSQGAAYSPBGQP 
MGS PVLDGQQHMG I RPAG PMSGMGMNMGHDGQWHYM 


6861 


1889 


1515 


DKDKKRQKKRGIFPKVATNIMRAWIiFQBLTHPYPSBEQKKQLAQ 
DTGLT1 LQVNNWFINARRI rVQPMIDQSNRAVSOGAAYSPEGQP 
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NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine„ C-Cysteine, D=Aspartic Acid, Ex 
Glutamic Acid, F=Phenylalanine, G«=Glycine, 
H=Histidine , Iajsoleucine , K=I»ysine , 
LwLeucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glut amine, R=Arginine, 
S=*Serine, T=Threonine, V=Valine, 
W=Tryptophan» Y=Tyrosine, X-Unknown, *»Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








MGS bVLDGQQHMGIRPAGPMSGMCSMNMGMDGQWHYM 


6862 


2 


471 


EE I DRBFHNKLKLKEDKLEKQEKP VNGEDKGDSGVDTQNSEGNA 
DEEDPLGPNCYYDKTKSFFDNISCDDNREHRPl^AEEP^ 
FGIPLRPNRGRGGYRGRGGLGFRGGRGRGt^RGGTPTAPRGFRG 
GFRGGRGGREFADPBYRKTTAPGP 


6863 


2216 


487 


PQ E PALKSEPSQVASNTI PLPL PQPNTO0DNGPCKQVCSTVGGS 
AICSCFPGYAIMAIXWSCEDQDECIiMGAHDCSRRQFCVNTDGSF 
y CVNHXVLCADGY ILNAHRKCVD I ^CVTDLHTCSRGKHCVNTIi 
GS FHCYKALTCEPGYALKIX?ECEDVDECA^ 
KGSFYC0ARQRCMIX5FLQDPEGNCVDINBCTSI»SKPCRPGFSCI 
NTVGSYTCQRNPIjICARGYHASDIXnT^ 

QVCHNLPGSYRCT>CKAGF^PJ3AFGRGCn3VNECNASPGRLCQHT 
CENTI/3SYRCSCASGFTjLAA1XIKRC 

GSYQOfCRQGYQI^EK5HTCTDIDBC^QGAGILCrFRCLNV^ 
YQCACPEQGYTmTANGRSCKDVDBCALG^mNCSEAETCHNIQGS 
FRCLRFECP PNYVQ VS KTKCBRTTCHDFLECQNS PARITHYQLN 
FQTGIiLVPAHI FRIGPAPAFTGDT IALNT IKGNEEGY FGTRRLN 
AYTX5VVYLQPJWL£PRnFALDVET^ FFT 
TFAL 


6664 


2 


2933 


LADSSPSWLQI IIKELLSMHHQPDPALrKEFDYLPPVDSRSSSG 
FVGLRNGGATCYMNAVFQQLYMQPG LPBSLLS VDDDTDNPDDSV 
FYQVQSLFGHLMES KLQYYVPENFWKI FKMWNKE LYVRBQQDAY 
E PFTSLiIDOJ^roEYliKX^RDQI FKJNT FQGIYS DQKI CKDCPHRY 
ERBEAFMALNLGVTSCO^^EISLIXiFVRGBVLEGSNAYYCEKCK 
BKRITVKRTCIKSIiPSVLVIHLMRFGFDWESGRS IKYDEQIRFP 
WMLNME P YTVSGMARQDS S S E VGENGRS VDQGGGGS PRKKVALT 
BN YE IiVGV T VHSGQAHAGHTYS PI KDRRGCGKGKWYKF1TOTVX E 
EFDLNDETLEYECFGGB^PKVYDG^NPYTDVRRRYWNAYMLPY 
QRVSDQNSPVLPKKSRVSWRQEABDLSLSAPSSPEISPQSSPR 
PimPNNDRI^ILTKLVKKaEKKGLPVEKMPARIYQMVRDKNLKF 
MKNRDVYSSDYFSFVI^LASIiNATKLKH^ 

FLPQTYl^TKKKIiRVDTEEW IATIEALiLS KS FDACQWIiVEYF I S 
SEGRELIKIFIJ,ECNVRBVRVAVATILEKTLDSAT,FYQDKLKSL 
HQI*I»EVLLALLDKDVPBNC3^ 

LRHSAlARHMISFbU^R^NQIRRWSSAC^EFGNLHNTVAIi 

VLHSDVSS<JRNVAPG1FKQRPPISIAPSSPLLPLHEBVBAIJ»FM 

SEGKPYI^VMFAIJlBliTGSIJ^IEMVVYCCT 

FIKNQLETAPPHELKNTFQLLHEXLVIEDPIQVBRVKFVFBTKN 

GIiIiALMHHSNHVDSSRCTO^TKPLVTLAQK^ 

WSWAVQWLQKKMSEHYWTLQSNVSl^TSTGia'FORTISAQDTLA 

YATALLNEKEQSGS SNGSESS PANENGDKHIiQQGSES PMM I GEL 

RSDLDDVDP 


6865 


1820 


1242 


DPBRWKPO^KVTPPGSSVSTTPVQVVRLQSPQSQGSMMPSCNRS 
CS CSRG PSVEIX5KWYGVRS YLHLF YBGYAVPP KLBGI GEGEFJjV 
LDQPJVADYNOALGTCRIAGTALCVAAGVIiLAI CLFWAM tGWliSQ 
DTTCAEPLDPEADSHVBVFGDB PEQQI*S P I FRNASGQS WFS P PAS 
PFGQSSVQTIQPKRDS 


6866 


1571 


495 


DCPRPRYTLYGLRATCMRDLDWAWINAVSAFKALEQDLPVNIKF 
1 1 EGMBEAGSVALEELVEKEKDRFFSGVDYI VISDNLW I SQRKP 
AITYGTRGNSYFWVEVKCRDQDFHSGT^ 

SLVD3SGHILVPGIYBEVVPLTEEEINTYKAIHLDI^YRNSSR 
VEKFIjFDTKEEI LMHXWRYPSI>S IHGIEGAFDKPGTKTVI PGRV 
IGKFSIRLVPHMNVSAVEKQVTRHLEDVFSKRNSSMKMVVSMTL 
GLHPWI^IDDTQYLAAKRAIRTVFGTEPDM IRDGSTI P IAKMF 
QE IVHKSWLI PIX5AVDDGEIISQNBKXNRWNYIEGTKLFAAFFL 
EMAQLH 
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ID 

NO: 
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beginning 
nucleotide 
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to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine , C=Cysteine, D=Aspartic Acid, E=* 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, k=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
Psproline, Q=Glutamine, RaArginine, 
Sr=Serine, T=Threonine, V=Valine, 
W=sTryptophan, Y=Tyrosine, X^unknown, *«Stop 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 


6867 


2833 


1704 


GTRIMSQPKQKEIAGFVRQKMLLDYSVyMGRCVPQBSRSPQRSP 

LQSAESSP1AGKKLPBVPPSEEBEQBAWVHALLGR1FWDFLGEK 

YWSDLVSKKIQMKI^OIO,PyFl^LTLTBI^MGVAVPiaiX^ 

KPYVDHQGLWIDLEMSYNGSPIjmJ2TKMNLTKLGKBPLVEALK 

VGEIGKEGCRPRAFCLADSDEKSSSAGSSEEDDAPEpSGQDKQTi 

LPGABGYVGGHRTSKIMRFVDKITKSKYFQKATETEPIKKKIBE v 

VSm'PIJ^TVBVQBCRGTIiAViaPPPPTDRVWYGFRKPPHVB^ 

ARPKI^EREVTLVHVTDHIKKKLBQEFQKVFVMPl^IJVT 

HSAMDPRSTSCLLKDPPVEAADOP 


6868 


1 


346 


RPTRPPTRPEEIKmillJYISDMNJfVQDLCKDE^ELFKTDKGFD 
KATFBSQ^VMRGQII^LTQALJRDGKSPPQLVQI PCVIVERSQG 
GS QGR IVHLSNS FTQTVNCRKPFFSS W 


£869 


3 


1619 


MYMKRMDKRALISFWESVEHLKNANKNEIPQIiVGEIYQNFFVES 

F rVSDI*YEKliI>I KKEEKHASQM I SNKDEMGPRDKAG RBAVDDGT 
HQINBQAS FAVW IOjRE KIiE YKRQALNS IQNAPKPDKKI VSK 
LKDE 1 1 LIEKERTOI^LHMARTDWWCENLGMWKAS ITSGEVTEE 
NGEQLPCYFVMVS LQEVGG VETKNWTVPKRLSE FHNIjHRKLSEC 
VPSLKKDQLPSI*SKLPPKS I DHlTMEKPSilQLNKFLC^fULSDER 
LCQSKAL.YAPLS PSPDYLKVIDVQGKKNSPSLSSFbERliPRDFF 
SHQEE ETEEDS DI>SDYGDD VDGRKDAliABPCFMI>IGEI FELRGM - 
FKWVRRTklALVQVTFGRT I NKQIRDTVS WI FSBOMLVYYINI F 
RDAPWPl^KXiAP PTTIRSKEQSQETKQRACXiKLljENI PDMLQSXi 
VGQQNARHGI IKI FNALQETRANKHIjIjYALMELLLIELCPELRV 
HJjDQLKAGQV 


6870 


1 


1566 


DL^VYGEPSRETPNl^P^IAAEGI^FPNFYSA^LCSPSRAAliLT 
GRLP IRNGPYTTNAHARNAYTPQEIVGGI PDSEQLLPELLKKAG 
YV3KIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP 
NI PVYRDWEMVGR YYBBFP I^KTOEANIjTQI YLQEALDFI KRQ 
ARHHPFFLYWAVDATHAPVYASKPFLGTSQRGRYGDAVRB1DDS 

igkjclellqdlhvadntfvpftsc^igaalisapeqggsngpflc 

GKQTT FEGGMRE PAIiAWWPGHVTAGOVSHQLGS IMDL FTTS LAL 
AGLTPPSDRAILX5IJ^PTLUJGRLMDRPI FYYRGDTLMAATLG 
QHKAHFWTWTNS WBNFRQG I DFCPGQWSG VTTHtfbEX)HTKLPL 
I PHI^RDPGKtt'PLSFASAEYOEALSRITSVVQQHQEALVPAQP 
QLNVCINWAVMNWAPPGCEKM KCLTPPBSI PKKCI.WSH 


6871 


209 


1126 


RMSLNPPI PT>KRSBENSSKFVBxTCQSQa^ 
QEVLQKAQXJSGRSKCLKCGGSI^FYCYTCYWVENVPIEQIPLV 
KLPLKIDI IKHPNETDGKSTAIHAK1XAPBEVNIYTYPCIPEYE 
ElODHEVALIFPGPQSISIKDISFKliQKRIQNNVRGKND 
FKRKRTBEQEFCDLNTOSKCKGTTLKKI I PIDSTWNQTNKIFTDE 
RLC^I^VELKTRKTCPWRHQKGKPDTFl^TIBAlYYl^VDiTO 
D I UCE KYRGQYDNLL.FFYS FMYQLI KN AKCSGDKBTGKXjTH 


6872 


880 


459 


FGUUWVLS LI FMKGNCTOEDIjI FNFI»FKI/3LDVRETNGLFGNT 
KXLITEVFVRQKYLE YRRI P YTEPAEYEFLWGPRAFLETSKML V 
LRFXAKI^KKDPQSWPFKYLEAIABCEWEDTO 
PTSRPPPR 


6873 


1929 


955 


DEQAVLCS KDKT YDLK 1ADTSNMLLFI PGCKTPDQIiKKE DSHCN 
I IHTEI FGFSNNYWELRRRRPKLKKLKKLLMENPYEGPDSQKEK 
DSNSSKXI^EDLUiQIQASEEBIMTQT^VT.NACKIGGYWRILEF 
DYF^LUraVT(^VDSESWSFGKVPl^ 

CIiKCYGKJCYVDEGEVYFELDADKI CRAAARMLLQNAVKFNLAEF 
QEVWQ<>SVPEGMVTSIiDQLKGIiAI*VDRHSRPEI IFLLKVDDLPE 
DNQBRFNSI^SLRFJCWTBEDILAPYIQDL 
S3WQNGVKVYNSRRP IS 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucl eofc ide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
CA^Alanine, C= Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=;Histidine, I^Isoleucine, K^Lysine, 
L=Leucine, M^Methionine. N=Asparagine , 
P=Proline. Q=Glut amine,. R=Arginine, 
S-Serine, T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyxosine, X«Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion) 


6874 


1 


307 


US IADHVNSAAVNVEEGTKNLGKAAKY 

GPIGLLAGFKVAGIAAALGGGVLGFTGGKLIQRKKQK^ 

SCPDLPSQTDKKCS 


€875 


1688 


349 


VIGTGERGNS AS E KWE I MFNBELGDPFI 1 1 HS I S LLNAEKHS IA 

TIJJ^IEKBELDMKGSGFWSIiEim'ISKKNQDNKKyEIIKRDI 

LRGKSVPHYAAIEPIXJNGLMIVSYKSLTFVG^GQDLEENMDEDI 

SBKIKKPLYYWOQTBDDLTVTIRLPEDNTKBDIQIQFLPDHINI 

VjUKDHQELEGKLYSSIDHESSTWIIKESNSLEISI^KKNEGLTW 

PELVIGDKQGELIRDSAQCAAIAERLMHLTSEELNPNPDKEKPP 

CNAQELBECDI FPEESSSLCRPDGlsrnjICT^ 

IVDP KEMPCFCLRHDVIJAIiLWQPHS SK01JDM WEHIJ\TF1^ 

OASIO©KKFFACAPNYSYAAICTCLRRVFIYRQPAPMSTVI>YNR 

KEGRQVGQVAKQQVASLBTNDPI LGFQATNBRIiFVIjTTKNLFLI 

KVWTEN 


6876 


41 


1285 


VGEmXlWRHLIjRPLCLVTSAPRILEMHPFI.SLGTSRTSVTKLS 

LHTKPRMPPCDFMPERYQVIFTjWSGSFJVNKIJV^^ 

DI IS FRGAYHGCSPYTIiGIjTNVGIYKMELPGGTGCQPTMCPDVF 

RGPWGGSHCRDSPVD^IRKC^CAPDCX^JAKIX?YIEQFKDTLSTS 

VAKS IAGFFAEPI QGVNG VVQYPKGFLKEAFE LVRARGGVCIAN 

EVQTGFGRJjG3HFWGFQTHDVLPDIV1WAX^ 

PEIAKSLAKCLQHFNTFGGNPMACAIGSAVLEVIKEENLQENSO 

EVGTYMLLKFAKLRDEFKIVCTVRGKGIWIGIRMVQDKISCRPL 

PRBEVNQIHBDCKEIMGIiLVGRGS I FSQTFRIAPSMCITKPBVDF 

AVKVFRSALTQHMERRAK 


6877 


1 


778 


GTSPSPARAYAPPTERKRFYQNVS ITQGEGGFBIHTJlHRKLKTP 
OAKLFTVPSEAIAIAVATEWDSQQOT I KYYTMHLTTLCNTSIjDN 
PTQRNKDQL I RAAVKFLDTDTI CYRVE EPETLVELQRNEWDPI I 
EWAEKRYGVE ISSSTSIMGPS IPAKTREVLVSHLAS YNTWALQG 
lEFVAAQLK^MVbTLGLIDLRLTVEKJAVIiLSRIiEEEYQIQKWGN 
IEWAHDYBI>QELRARTAAGrijFIHLCSESTTVKHKIJjKE 


6878 


531 


263 


QTIKXJDFKNRAEMIDFNIIUKNVTRSDlflGKYRCEV 

LEEDTVTLEVLVAPAVPSC^VI^SAI^TVVEI^CQDKEGNPAP 

EYTWFKIX5IRIjLENPRLGSQ^TNSSY1WNTKTGTIjQ 

TGBY8 CEARNSVGYRRCPGKRMQVDDIiNISGI IAAWWALVIS 

VCGIX?VCYAQRKGYFSKBTSFXJKSNSSSKATTMSE^ 

II 


6879 


3 

t 


845 


I RVI GESD1 MQB KLSBSDRN YNG V5 D VELR VAl^IX^TTVTVR VK 
KNSTTDOfVYQAIAAKVGM)STTVNYFALFE\rrS 
FPHKLYXQNYTSAVPGTCI.TIRKWLFTTBBEIIJ^NDIAVTYF 
FHQAVDDVKKGY I KAEEKS YQLQKLYE^PJWVMYLNMLRTCEGY 
NEIIFPIICACDSRRKGHVI TAIS 1 THFKLHACTEEGOLENQVIA 
FEWDEMQRWDTDEEGMAFCFE YARGEKKPRWVKI FTPYFHYMHE 
CFERVFCBLKWRKEBY 


6880 


2U0 


1437 


RKDNCTAKEWTFPEAKWNTTARVFSHXRLGMGHVI.I IVQCFISS 
MANI YNEKILKEGNOLTES IFIONSKLYFFGII.PNGL,TLC3T^>» Q 
NRDQI KMOTFFYGHRAF^ALIFVTAFOGLSVAFILKFIJDNMFB 
VLMAQVTTVIITWSVLVFDFRPSLEFFLBAPSVM>SIFIYNAS 
KPQVPEYAPRQERIRDLSGNIiWERSSGDGBBLERLTKPKSDESD 
KDTF 


6881 


263B 


2244 


NDSKWEDI HVITGALKMFFRBIjPE PJoFTFNHFNDFVNAIKQEPR 
QRYAAVKDLIRQLPKPNQDTMQILFRHLRRVIEKGEKNRMTYQS 
IAIVFGPTIjLKPEKETGNIAVHTVYQNQIVELILLELSSIFGR 


6882 


1 


850 


GI PEAQLWI YPVKS C^GVP VS EAECTAMGLRSGNLRDRFWIjVIN 
QEGNMVTARQBPRLVT,ISLTC3X3DTLTLSAAYTKI>IiLriPI KTPT 

TNAVHKCRVHGLE iegrdcgeataqwits flksqpyrlvhfeph 

MRPRRPHQIADLFRPKDQlAYSDTSPFLlLSEAySIiADLNSRLEK 
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1X> 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C= Cysteine, D«*Aspartic Acid. B=* 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H*»His t idine , 1=1 soleuc ine , K-Lys ine , 
Ii=I*eucine, M=Methionine, N=Asparagine, 
P= Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=*Threonine , V^Valine, 
W^Tryptophan, YaTyrosine, X»0nknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








KVKATNFRPNIVISGCDVYAEDSWDEIjLIGDVEIiKRVMACSRCI 
L'rTVDpDTOVMSRKBPIiKTLKS YR(X^DPSBRKLYGKS PLFGQYF 
VLENPGTJKVGDPVYIJjGQ 


68B3 


2794 


2256 


NS KLKLNQN LKLF I TLTYQ VLS LHGWGPG IHLQKEGAFP VTQNR 
ALQLLYDMYI^IVLTAKGDBVKSGJ^KPDSRlBKVTDHIiB^I 
DPFDLDVFTPHI^SNiaRLVQRTSVI^LVTGTENQLAPRSSTF 
ORPHNTTtPT AfifViTR FTJT.TiPIifJMT^TR KA1CSTPWT RVKTWrVT) 

W3C 


&884 


2 


99 


BFroVTAEAVKPKBXSBPRAAAQRFCKKPPPI- 


6805 


297 


1554 


STGQPWHVTOLHIiDP/rnilTDDHTKVC^ 
I^SPYQLII^SAFTJFIKNSGQEASFMIWTC^SPPIIVPVPELSTD 
TVINVITNMTTTIQSIiFP^VPPAI^NHDyWPQDQLSV^ 
YNAVANIWKP^IBEAISllJttGGFY^ ISLNTN 
LYYGPNim^NKTDPANQFEWLESTlJ^SQONKEKVyi IAHVPV 
GYtPSSC^ITAl^REYYNBKLIDlFQKYSDVIAJSQPYGHTHRDSI 
MVLS DKKGS PVWS LFVAPAVTPVKS VLEKQTNNPG IRLFQ YDPR 
DYKLLDMLQY YLNIiTEANI»KGSS I W KXEY ILTQTYDI KDLQPBS 

IMNtDNISYADOiKQLYIKHNY 


6886 


2 " 


■ 1341 


QCGGIPGREGGSSRPLKEGTGSSPACVRGAAPGSEDAFYPTRAK 

(^VSQKbKKAAK^TVSISEGPDTIX5lX^ERP^TIiALAPKPKP 

LEKEACEXWKRPFRSASATSI/TLSHCV^ 

GGAPBQRYQI I PVCVAARLPTRAQDVLDAHLSBVNAVRFGPNSS 

LIATGGADRLIHI*WNWGS RLEANQTI»SGAGGS ITS VDPDPSGY 

QVIJ\ATYKQXUVQLWKVGKAQSKBTLSGHKDKVTAAK^ 

irT/^" CTJTXD WITWtJnT rMJUVPODTTMUT O V<^TkTri\ n r/~»T3TYLI T T T Qrsuvr 
V JX?£>rCUK.± VAJ£WiUA»KAI LbKl J.N VIj£» X LNU v V (JljLlhiJL J. JLolmN 

DQKIRFWDSRGPHCTQVIPVQGRVTSLSL9HDQLHLLSCSRDNT 
I^IDI^VSNIRQVFRAIX3PKCGSDWTKAVFSPDRSYALAGSCD 
GALYIWDVDTGKI*ESRI^GPHCAAVNAVAWCYSGSHI4VSVDQGR 
KWIiWQ 


6887 


1047 


116 


WTARPSQKPFWEAGAVPGDPI.STGCSOA0LGGCCPRGPWGPQHG 

G0X3RAAGPTLPRGBRGGPQQSGPGLAAQTPPTSKQVAWRAFLTG 

TYRSQSPRSPAGPPRGGTGWWPKPAVCLCVAVGPQRLSSPGLVY 

HASGSEHCYBIYRIiYHSCADP^TGOG*lQPDftRAVroY0ACTE 

PASNimDMPPIM.PPTDELRQRYCLDTW 

DLRAASNIIFSNGtTLDPWVGGGIPJR^^ 

LRASHPEDPASWEARKLEATI IGBWVKAARREQQPALRGGPRL 
SL 


6888 


X 


992 


FVAYVKKB1 PH IVVTHCIiLNPHALVI KTLPTKLRDALFT WRVI 
HFI KGRAPKHRLFQAFFBK I GI B YSVXJ*PHTEMR WLSRGQILTH 
I FEMYKE INQFLIIHKSSNLVIX3FBNKEFKIiniAYItADLFKHLNB 
I>SASMQRTG^1NTVSARFKIlSAFVRXFPFWQKRXKKRNFTNFPFl, 
EEI IVSDNEGIFIAABITI1HI1QQL1SNFFHGYFS IGDXiNEASKH I 
LDPFIiFNIDFVDDSY1iMKNDIiABIjRASGX2II»MBFETM^ 
AQFTAFTNIAKTAI^IIMPFATT^ 
LILSDDIRVAISKKVPSFLGHH 


6889 


1 


1534 


LTLENQI KB EREQDNSBS PNGRTSPLVSO^WEQG^TLRDIJjTTT 

AGKLRVGSTDAGIAFAPVYSMGAPSSKSGRTMPNITaDDI IASW 

ENKIPPSKTSKINVRPBIiKKKPEKSIISAVDENNKLYSDrPHSW 

ICEKHILWLKDYKNSSBWKI^FKECWKQGQPAWSGVHKKMOT 

WKAESISLDFGDHQADLLNCKDSI ISNANVKKFWDGPEEVSKRQ 

KinCSGKTVVLKI,KDWPSGEDFKTMMPARYEDLr,KSLPLPEYCNP 

EGKFNIASHLPG FFVRPDI^PRI»CS AYT3VVAAKDHD1 G 

KVS DWN ILVYVG IAKGNG ILS KAG ILKKFEEEDUDD ILRKRLK 

DSSBXPGALWHIYAGKDVDKIRBFLQKISKB^LEVIiPEHDPIR 

DOS WYVNKKLRQRMiBEYGVRTWTL I QFI/3DAI VLPAGAIiHQVQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A- Alanine, OCysteine, D^Aspartic Acid, E= 
Glutamic Acid, P= Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K>Lysine, 
I>=Leucine, M=Methionine, IfcAsparagine, 
P=Proline, Q=Glutaraine, R-Arginine, 
S=Serine, T=Threoni_ne , V~ Valine, 
W-Tryptophan, Y^Tyrosine, X=Unknown, *-Stop 
Codon, /=possible nucleotide deletion, 
\=poesible nucleotide insertion) 








NFHSCIQVTEDFVSPEHLVBSFHbTQBLRIdjKEKINYDDKLQVK 
NII.YHAVKKMVRALKIHBDBVDDMEEN 


6890 


3 


667 


THACGMWI PLYLHRALWHKTAETCNS PP CGAKDSL I FGAITCF 
TGFIKSVITIGAGATRWCRLKMRA^ 

AAJCSSIVGAYICIPVCETI^FSNWAITADIUIYWIPTRRATAV 
AI^SF^SHLI^AGSPYLIGFISDLIl^TKDSPLWEFLSLGYA 
LMLCPFVWIX5(WFLATAr^FVSDRARAEQQWQIJUnPPASVK 
V 


6891 


I960 


1262 


LRIHQEIiLSKKLKLLRGiriBSI IHIGIiAftGKEQFMQDASNVMO; 

T«rT^V*roc?iiT."VT{rMTrr>'wiflroi7irtj/~ii\'7\ bvpt s^tnut&fvonrwYnvnrvT hot? 
jjuua. x\/£»nii i cimsunn rKV kviaaa x » jjl? V raAU jtUuLUJ YRS ItCS B 

AVPLLVKVI KKAHSKTKKWIATENCISAIGKII^KPWCVWTO 

EVLPHWI^WLPIJimKEKAIQTLSFIj^LIESNHPVVlGPNNSN 

LPKI IS I IAEGKINETINY^PCAKRLANVVRQVQTSEDLWIiKC 

VSQLDDBQQEALQBLLNFA 


6B92 


3 


876 


RSVAAASGPGAWGTDHYCLELLRKRDYEGYLCSLriLPAESRSSV 
FALRAFNVFJAQTOJSVSEKTIGLMRMQFWKKWEDIYOT 
L?FVjftJ.15LiWKAV 1LKHNJ.1 1 itKWLMKJL VDEREKNI»DDKAYRNIKEIJB 
NYAKNTQSSIlLYLTLE I LG I KDLHADHAASH I GKAQG1 VTCXiRA 
TPYIIGSRRKVFI,PMDICMI.HGVSQEDFmRNQDKNVRDVIYD.TA 

RCi AHT»Tir JfHAtt <5 PHTC'PU l>VTTAPD a vj -/YTV cj T.l?rrc?r MVTf\o\i iiiJi \ 
^miunuMuiiwriiivivi v nnrrAr voJ-ix*L#rijjViViyK.v jjrij 

IFHPSLQQKNTLLPLYIiYIQSWRKTY 




1 


842 


DGERKSM3VERTFSEIWKAEEQYSLCQEEiCSEIiAQDIiQKKRI»KG 
RTVT I KIiIOiVNFEVKTRASTVSSVVS TABE I FAI AKELLTCTEID 
ADFPHPLRLRLMGVR1SSFPNEEDRKHCX3RSI IGFLQAGNQALS 
ATECTLEKTDKDKFVKPIiEMSHKKS FFDKKRSERKWSHQDTFKC 
FAWKQSFQTSQPFQVLKKKMNENLEISENSDDCQIIiTCPVCPR 
AQGClSI*EALNIOIVDECXIX3PSISBNFKMFSCSHVSATKWKKB 
NVPAS SLCBKQD YEAH 


6894 


1742 


1463 


inOiCKPLVPREHQFYETIiPAEMRKPTPQYKGKSQIJjEGLPHWRG 
DVRDRGHGRPWQPSLBPSLPPTLCFPSLSSFSSSWPSAQHLTPS 
VFNPW 


6895 


2379 


478 


VTYVBLCDIASPTAI^IMimnjDLIVRDLQSTSEDKEGXJYTSOT 
TRI»IALIi YALASHKACKliAl LHL I NGT I KGDERYAE I FQDLIiAL 
VI^PGDSVIRQQCVBYVTSII^LCMDIALII^SSSBGSISKL 

AEHDYGLFKJ>KSSLRKNSSAIiHSriKRVVSTFSKDTGEIiASSF^ 

E FMRQ II^SDT IGCCGDDNGLMEVEGAHTSRTMS IWAAELKQLL 

QSKBESPB1^FIjBIiEKLVLBHSKDDDNIJ>SLLDSVVGLKQMLES 

SGDPI*PI^IWVEPVLSAPESLQNl^FNirc 

MWFTPFQAEEIDTDLDLVKTOLIEI^EKCCSDFDLHSELERS fl 

SEPSSPGMTCTTKGFKLGKHKHETPITSSGKSEYIEPAKRAHVV 

PPPRGRGRGGFG<X3IRPIfflIFRQRKX2NTSRPPSMHVDDFVAAE5 

KEVVPQDGIPPPKRPLKVSQKISSRGGFSGNRGGRGAFHSQNRF 

FTPPASKGNYSRRF^TRGSSWSAQNTPRGNYNESRGGQSNFNRG 

PLPPLRPLSSTGYRPSPRDRASRGRGGLGPSWASANSGSGGSRG 


6896 


1 


555 


G^IVIQKKKYNKQHI IPLENVTIDSIKDEGDLRNGWLIKTPTKS 
FAVnfAATATBKSEWMJraXNKC^m>LI^KSGKTPSrraiAAVWVPD 
SEATVCMRCQKAKPTPVNRRHHCRKCGFVVCGPCSEKRFIJjPSQ 
SSKPVRIOTFCYDLI^AGDMATCQPARSBSYSQSLKSPLNDMSD 
DDDDDDSSD 


6897 


3 


920 


GDGI^HKVWGIJ^ERPDWErAlQKPLCSLPAGSGNAIJU^LNHY 
AGyEQVTNEDW/niCTUJXI^^ 

SLAWGFIADVDLBSEKYRRr/3F^FTl^TFliPXAALRTYRGRtA 
YLPVGRVGSKTPASPVVVQC^PVDAHLVPLEEPVPSimTVVPDE 
DFV^ VIiALLHSHZjGSEMFAAPMGRCAAGVMHLPYVR 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AaAlanine, C=Cysteine, D^Aspartic Acid, B=* 
Glutamic Acid, P=Phenyl alanine , G=Glycine, 
H^Histidine, X=Isoleucine< K= Lysine , 
L=Leucine, M— Methionine , N=Asparacjine, 
P=Proline» Q=Glut amine, RaArginine, 
S=Serine, T=Threonine f V=Valine, 
W=»Tryptophan, Y=Tyrosine , On known, *=»Stop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 








LMVSEAVQGQVHPNYPWMVSGCVEPPPSWKPOQMPPPEEPli 


6898 


919 


346 


QKTVTAVAS LLKGRQG I YTENERRMGAV I KIRFFKXMLVtil ICW 
LSNI INBSLLPYLBMQTOINGX5SLKPVRTAAKTTWFIMGILNPA 

MPHENPASGKVSQVGGOTSDEAI^MJ^KGSDASTIBIHTASBSC 

XVJIX* JHaU rViXAc X HI71/LI 


6899 


120 


827 


MKVRKJWDAYLLDKNKlNMlXTISCPFKKr^TTI^ 

Ui&fSVlSJ&X XIToJLttr^^AXAKOXLll. V rn VE>JXAj[VVo VHwiK.X\jIoMr>l 

TFHTKPFYGGKLHRVTABVKHNITNTWCRVQGBWNSVL 

EIDKATEHKHTTiEERQRTEERHRTETGTPWKTKYFIKEGDGWY 
HKPLWKIIPTTQPAB 


6900 


r 3 


451 


TEVLGSKGIHELi^STSALHHALEESASLLTMPWRAALPSTHI P 
VLPGKVGE3TERELIiEI»RTKYSCX3EQ 

MEQF I VSQItTRTHDVLKKARTNLKVR KUjHQSEAPST^S PTHHHP 
IADLVGDSWPALRFQEK 


6901 


1 


201 


DDNMVQRLKTDFKMTLQ<&ST^^ 
SFPKAAHQFLLKWePYRYHLGFS 


6902 


2 


267 


GAPPPPPSQPPRQPPQAAPSSHpHSDLTFNPSSAIiEGQAGAQGA 
SDMPEPSLDLLPKLTNPDKlabSYUDPPDLPSNSNDDLLSLFKNN 


6903 


1 


149 


RINQVYRQGPTGI HI LVIDQMVQNFQDESCFtFSTVKAE^SlIX^ I 
HULK 


6904 


464 


2092 


MEASLpVSI^CVIiACCDVEGKFDILFNRVQAIQKKBGNlfUljLJj(J 
VGNFFGSTQDAEWEBYICrG I KKAPIQTYVU5ANK0ETVKYFQDA 
dgceiiaenityi^rkgiftgssgi»qivyl»s^ 
SPKDVSSLRMMLCTTSQFKGVDIIJjTSPWPKCVGNFGNSSGKVD 
rKKCGSALVSSIiATGLKPRYHFAAIiEKTxTERLP YRNHI ILQEN 
AQHATRF I ALANVGN PEKXKYLYAFS I VPMKJMDAAELVKQPPD 
VTElfP YRKBGQEAS IGKQIl*APVEESACQFFFDtlJEKiCX3RKRSS 

lYiPriK VC QDHPTfrtPPITPPnPPfiPCSWf , T JLS PHVE KHT.WNTCTFI 

CYIAUOOSGI^DDHVLItiPIGHYQSWELSAB^ 

RRFFKSRGKWCVVFERNYKSHHLC^VIPVPISCSTTDDIKDAF " 

ITQAQEQQIElJ^IPEHSDIKQlftQPGAAYFYVEI^^ 

IKKtf I' PLQFGRE VLAS EAI LNVPPKSDWRi>CQI S KBDEETJL«ARR 

FRKOFEPYDFTLDD 


6905 


1 


226 


VSFCTGKAETITSHYLFALGVYRrbYLFNWIWRYHFEGFFDLIAI 
VAGLVQTVLYO)FFYLYITKVLKGKKLSLPA 


6906 


3 


611 


S YDDHNGHIDFI TAASNLRAKM YS IEPADRFKTKRIAGKI I PAI 
ATTTATVSGXjVALEMIKVTGGYPFBAYXNWFIJ«LAI P IWFTET 
TEVFJKTKIRNGISFTIWDRWTVHGKBDF^ 
PTWVVQGVKMLY^VWFGHAKRLKM 
SFAPDlDGDEDLPGPPVRYYFSHDTD 


6907 


2 


2228 


LRGVPVWAAGAFRFS SGEESTS HLI MS RR£QRI>TRYS QGDDDGS 

SSSGGSSVAGSQSTIbFKDSPLRTLKRKSSNMKRLSPAPQLGPSS 

DAHTSYYSESLVHESWFPPRSSLEELHGDANWGEDIiRVRRRRGT 

GGSESSRASGLVGHKATEDFIXJSSSGYSSBDDYVGYSDVDQQS5 

SSRLRI^VSRAGSLLWKVRTSIKJRIjF 

SLLDVFVLTRRFSSLKTFJ^FIxLPIJiIJiTCLTY^ 

TFRPALVSWWAAKDSRKADEGWEARDSSPHFQAEQRVMSRVHSL 

EPJU^AIAAEFSS^QKSAMRLERLELRQGAPGQGGGGGLSHED 

TIaALLEGLVS RRBAALKEDFRRE TAARIQBELSALRAEHQQDS E 

DI»FKKIVRASQESEARIQCJQKSEWQS^^^BSFQESSVKEIlRRI>E 

DQLAGI^BIAALALKQSSV^ 

WISQFIiARGGGGRfTOIJWRKKMQAQLRELESKILTHVAEMQGKS 
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SEQ 
ID 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal pepta.de 
(A=Alanine, C=Cysteine , D^Aspartic Acid, K~ 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H-nistidine, I=Isoleucine, K=Lysine, 

Leucine, M=Methionine, N=sAsparagine, 
P=Proline, Q~Glut amine, R^Arginine, 
S=Ser-ine, T^Threonine , V=Valine, 
W=*Tryp t ophan , Y=Tyrosine, X=Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ARKAAASIiSLTLQKEGVIGVTKEQVHHIVKQAIiQRYSEDRIGLA 
DYALESGGASVISTRCSETYETKTALLSLPG1PLWYHSQSPRVI 
I^PDVHPGNCWAFQGPQGFAVVRLSARIRPTAVTLJBHVPKALSP 
NSTISSAPKDFAI PGFDEDliQQEGTLWSKFTYDQDGBP IQTPHP 
QAPTMATYQWELRILTNWGHPEYTCIYRPRVHGBPAH 


6908 


3 


780 


QVPSAAWLMAVCGLGSRLGI^SRIXJLQGCFGAARI^YPRPQSRG 
P<X5VEDGDRPQPSSKTPRIPKIYTKTGDKGFSSTFTGKRRPKDD 
QVFPJWGTTDEL SSAIG FALELVTEKGHTFAE ELQKIQCTLQDV 
GSALATPC^SAREAlILKrTTFKAGPILBIiEQWlDKTrSQLPPLT 
APILPSGGKISSAI^PCRAVCRPAKRRWPLVQMGKTDANVAKF 
LNPJ^DYLFTIJVRYAAMKEGNQEKIYKKNDPSAESEGL 




3 


409 


GRLLAVGTDLYGQRS SAPEQE LLVQ0ATP VSNS L LPE KAFSDI P 
SPYLRGTIKMMQAVRQAFQDQDDRRTWTC 

LCVVDTIKRSSQTGEWO^IAIMTEEPFJjSPAYLISRAMRRSRMS 
LYC 


6910 


1 


10(58 


lvpwvxdsyyygklviaplnivlyniftphgpdlygtepwyfy 
lingfljtpnvafaiju^viiplaslmeyllqrpwqnlghpywlt 

LAPMYIWFI IFFIQPHKEERFLFPVYPLICLCXJAVAIjSALQHSF 

LYF^KCYHFVFQRYRIiEHYWraNWLAIiBTVPLF^ 

LFRGYHGPIiDLYPEFYRIATDPTIHTVPEGRPVNVXnrt5I0Sf?YRF 

PSSFliLPDNWOLQFIPSEFRGqiiPKPFAEGPLATRIVPTDMMDQ 

NLEK PSRYI DI S KCHY LVDLDTMRKTPRKPKYSSNKEBW I SLAY 

RPFLDASRS S KXLRAFYVP FLSDQYTVYVNYT I LKPRKAKQIRK 

KSGG 


6911 


1184 


966 


G E DAEEMETGNVANL I S I FGSSFSGLLRK5PGGGREEEEGEESG 
PBAABPGQICCDKPVLRDMNPWSTAIVAF 


6912 


1 


844 


amkp vethsfqmlfti lstgs alkaqs yedayrci kss illgsi 
sggtdiiscfmghnfsi*pvykgeigarni/3maveawnebgkavw 
gesgklvctkpipcxjpthfwndengnkyrkayfskfpgiwahgd 
yc31i1^ktggivmlgrsix5ti^pngvrfgsseiyni^f>sfee^ 

DSLCVPQ YNKYREERVT LFTjKMASGHAFQPDLVKR I RDAI RMGL 
SARHVPSLILETKGIPYTLNGKKVEVAVKQI IAGKAVBQGGAFS 
NPBTLDIiYRD I PBLQGF 


6913 


1643 


. 1558 


KKSHEESHKBELSYGAQASLPLPCSDFR 


6914 


1251 


615 


ELAAECJCSAGYPGTLIPYR(^I^NEE1)II^FSAIRS^HSGVDI 
CINl^rJ^Pimj^STSGmDMFNVNVIJVLS I CTREAYQSMK 
ERNVDDGHI INI NSMSGHRVLPLSVTIIFYSATKYAVTALTEGliR 
QBI^KAG^mPJ^TCISPGWETQFAFTa^HDKDPEKAAATYE^K 
CLKPBBVABAVIYVI^TPAHIQIGDIQMRPTBQfVT 


6915 


254 


652 


GRSLSFKTFLIWVLISIYQGGIIiMYGALVIiFESEFVHVVAISFT 
ALILTEIiLMVALTVRTWHNLMVra 

VAFITTVTFXWWSAITWSCm.YVLKYLRilKI^PP^YCia^ 


6916 


254 


652 


GRSLSFKTFLIWVLISIYQGGII^GALVLF^SEFVHVVAISFT 

ALILTELI^ALTVKIWHWLMWABF 

VAFTTTVTFLWKVSAJTVVSCLPLYVLK^^ 


6917 


254 


652 


GRSLSFKTFLIWVTjISIYO^IIJWGALVLFBSEFVHVVAISFT 
ALILTE LIjMVALTVRTWHWLMVVAE FLSLGCYVS S LAFLNE YFD 
VAFITTVTFLWKVSAITWSCI^LYVLICYLRRKLS PPSYCKLAS 


6918 


28 


921 


PEAGTRSWREPDPEDLRRFIJjSAACRSFPQWLPGGGGGQVSSCS 
DTDVP YLLLAVKS E PGRFAERQAVRETWGS PAPGI R LLFLLGS P 
VGBAGPDLDSLVAHESRRYSDLLI^DFLDVPFNQTLKDLIJ»LAW 
LGRHCPTVSFVLRAG#DAFVHTPAI^^ 

VFTQAMPLRKPGGPFYVPES FPEGGYPAYASGGGY VIAGRLAPW 
IiLRAAARVAPFPFEITNOTGLCIRALGLVPQAHPGFLTAWPADRT 
ADHCAFRNLLLVRPLGPQAS IRLWKQLQDPRLQC 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

CtnULUU C*l — LU 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alonine, C=Cysteine, D-Aspartic Acid, K- 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H»Histidine, I=Isoleucine, K=*I#ysine, 
li=tieucine, M^Methionine, N=Asparagine, 

S— Serine, T— Threonine, V= Valine, 
W=Tryptophan, Y»Tyrosine, X "Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=posaible nucleotide insertion) 


6919 


850 


41 


QGRRELSGSVFCPFIQQEPKEMLTIiSEYHBRVRSQGQQLQQLQA. 
EI^IOiHKEVSWRAANSERVAKLVPQRLNfiDB'VRKPDYALSSVG 
AS IDI^KTSHDyADRNTAYFWNRFSFWNYARPPTVILBPHVPPG 
NCWAPEGIX^VVIQLPGRVQI^DITI>QHPPPSVEHTGGANSAP 
RDFAVFFIJ^PFTHQGIiQVYDETEVSr^KFTFDVEKSEIQTFHL 
QNDPPARPPKVKIQILSNWG11PRFTCX.YRVRAHGVRTSEGABGS 
AQGPH 


6920 


1418 


591 


EAQGPSKVHLTLKKKK 


6921 


2 


1711 


MNATP^EEQFHVINHABQTLPJCMEimJCEKQLCDVLLIAGHLRI 

PAHRLVLSAVSDYFAAMFTNDVLEAKQEBV^ 

Q YAYTGVLQLKEDTIESIXAAACLIjQIiTQV I DVCSNFL I KQLHP 

SNCXGIRSPGDAQGCTELIiNVAHKxTMEHFIEVIKNQEFI*LL 

NEISKLLCSDDINVPDEETI FHALMQWVGHDVONRQGELGMLLS 

Y IRLPLLPPQLLADIiETS SM FTGDLECQKLLMKAMKYHLLPERR 

SMMQSpRTKPRKSTVGAIiYAVGGMDAMKGTITIEKYDLRTNSWL 

H IGTMNGRRLQFGVAV I V*i KJjYWGGRDGLKTLNTVK CFNP VGK 

I WTVM PPMSTHRHGLGVATI*EG PMYAVGGHDGWS YLNTVERWDP 

EGRQWNYVASMSTPRS WGVVALNNKLYAJGGRDGSS CLKSMEY 

PDPHTNKWSIiCAPMSKRRGXJVGVATYNGFT^^ 

SPO»SDC^BRYDPKGD3H3TVAPI^VPRDAVAVCPI^KLYVVGG 

YIX?HTYIJyiVBSYDAQRNEW 


6922 


1075 


369 


IiTPPAGIRHBVRDRBRERERERKRBKFPIjDSTGSEIJOQNIHSIT 
G JjP P AMQKVM YKGLAPEDKTLREIKVTSGAKI MGGGSTINDVLA 
VNTPKDAAQQDAKAEFJJKKEPIX^Q 

KGAQEPJjPTV^LSGMCTKSGGKVRIiTFKLEQDQLWIGTKERTBK 
LPMGSIKKWSEPIEGHEDYHMMAFQIjGPTBASYYWVYWPTOY 
VDAI KDTVLG KWQ YF 


6923 , 


2469 


1660 


LGLFCILPIDTIiCAVLERDTLS IRESRLFGAWKWAKAECQRQQ 
LPVTFGNKQKVIGKALSblRFPLMTI EKFAAGPAQSGILSDREV 
VinjFIiHFTVNPKPRVKYIDRPRCXrUiGKECCIN^ 

pf^n cnnTDiyp^rMOPTo Tiw*py*T •w* o th/"* tvt rw/-vi tut r\ t tpvlw 
SGTSDRIRFU VJMHRISIVGrialj YGS IHfelrTIJlQVNXUilJSl bKK 

QTIiGQNDTGFS CDGTANTFRVMFKE P IE I LPN VC YTACATLKGP 

DSHYGTKGLKKVVHF/1TAASKTVFFFFSSPGNNNGTSIBDGQIP 

BIIPYT 


6924 


2210 


1235 


PEERVICFVKYYLXAFHEGRKGAUAKKPYNPI IGETFHCSWEVP 
KDRVKPIOn'ASRSPASCHEHPMADDPSl^YKLRFVAKQVSHHPP 
ISCFYCECTBlOlU^NTHVWTra 

HGEEYVFTLPSAYARS ILTI PWVELGGKVS INCAKTGY S ATV I F 
HTKPFYGGKVHRVTAEVKHKPTMIVCKAH^ 
ETKVIIXrTTI^WPKKIRPLEKQGPMESRNLWREVTRYIiRLGDI 
DAATEQIOiHLBEKQRVEERKREKLRTPWKPKYF IQEGDGSG I LQ 
SPXJBSTLMGIiEVQSFPV 


6925 


2 

• 


1653 


RGGAAGAAMBPDSVIEDKTIELMCSVPR3LWLGCANLVESMCAL 
^CT^n^MP^URfTiO T ciNGTS S VT VSRKRPSEGNYOKEKDLCIKYF 
DG^SESDQVEFVEHLXSRMCHyQHGH INSYLKPMLQRDPITALP 
EQGLDHI AENIL S YLEIARS LCAAELVCKEWQRVI S EGMLWKKLI 
EPJ4VRTDPLWKGLSERRGWDQYLFKNRPTDGPPNS FYRSLYPKI 
IQD IET IESNWRCGRHNLQR IQCRS ENS KGVYCLQ YDDEKI I SG 
LPJDKSIKIWDKTSl^CIiKVI.TGHTGSVlXl^ 
TVRVWDVNTGKVLNTLIIfflNE^ IAV 
WDMASATDITLPJIVLVGHPJUVVNVVDFDDKYIVSASGDRTI KVW 
STSTCEFVRTLNGHKRGIACIiQYRDRLVVfiGSSDNTlRLWDIBC 
GAO^VI^GHBBLWCIRFDNKRIVSG^ 

RAPASTLCl^TLVBHSGRVFRIiQFDE FOI IS S SHDDTI L IWDFIj 
NVPPSAQNETRS PSRTYTY I SR 


6926 


1 


733 


SGRVAMIXSLGLQFPEWFTAGPPIiPPHMGGHYRDW 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre spondlng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acxd segment containing signal peptide 
(■^Alanine, C~Cysteine, D=Aspartic Acid, E- 
Glutamic Acid, ^Phenylalanine , G=Glycine, 
H=Histidine, l=l3oleucine, K=Lysine, 
L=l»eucine, M=Methionine, N-Asparaginc , 
P=Proline, Q^lutamine , R=^Arginine, 
S=Serine, T=Threonine, V~Valine, 
W=Tryptophan, Y=Tyrosine, X -Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\*poasible nucleotide insertion) 








DGYPLPTPDTSPbDGVDPDPAFFAAPMPGDCPAAGTYSYAQVSD 
YAGPPEPPAGPMHPKLGPEPAGPSIPGLLAPPSALHVYYGAMGS 
PGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPTPPPRALPCRDGT 
DPSQPAELLGEVDRTEFEQYLHWCKPEMGLPYQGHDSGVNIiPD 
SHGAt SSWSDASSAVY YCNYPDV 


6927 


2 


1484 


LTLCX5DIQL^u^AQNANNRAAHLEBFHYQTKEDQEILHSI 1 HRESS 

CQG FAWATDLS TDLBSQIfSVSOTCYEAAtreiliQFRDIiKSQWPSH 

YVQVXjKRMGNIRNKl^FTmQAAAIiQSERWSKSVSAAEC^ 

KKSFSCFEKGIHNFESI BDATNAALIiI/tNTORLMRI CAQAHCGA 

GDKLKRBFSP E RGLY YNKAI DYYLKALRS LGTRDIHPAVWDSVN 

WBLSTTYFTMATLQQDYAPI>SRKAQEQIEKBVSBAlWKSLKYro 

VDSVSARQPLCQYPJ^TIHHIUASMYHSCIiRNQVGDEHLRKQHR 

VI^UIYSKAAKI^LLKDAPCBI^VQIiERVAFAEFQMTSQbK 

NVGKLKIMGAI^IMVRTBHAPQIjIQKKLIEEFGQPKSGDAAAA 

ADASPSLNRKEVMKIjIiSIFBSRLS 

IKDDTIUCTNKHIYSQLLRATANKTATI^ERINVIVHLIiGQIJA 
GSAASSNAVQ 


6928 


1086 


777 


EAIDLrNNIiLQVXMRKRYSVDKTI^HPt^DYQTWLDLRELECK 
IGERYITHBSDDIiRWEKTAGEXX5IjQYPTHLINPSA3H3DTPBTB 
ETEMKALGBRVSIt, 


~~ 6*929 


1749 


607 


RDQRGYRDDRSPARBPGDVSARTRSGGGGGRSATTAMPPPVPNG 
NLHQHJDPQDIiRHNGNVVVAGRPSCSRGPRRAIQKPQPAGGRRSG 
RGPAAGGLCLQPPDGGTCVPEBPPVPPMDWEALEKHLAGI^F^ 
QEVRNQGQARTNSTST^KNERES IRQKLALGS FFDDGPGIYTSC 
SKSGKPSLSSRLQSGMNLQI CFVNDSGSDKDSDADDSKTETSLD 
TPI^PMSKQSSSYSDRDTTEBE3ESIJ>DMDF1jTRQKKIjQAEAKM 
ALAM7UCPMAKMQ VEVEKQNRKKS P VAD LLPHM PH I S ECLM KRSL 
KPTDLRDMTIGQI^QVIVNDLHSQlKSLNEEXiVQLIiLIRDELHTK 
QDAMLVDIEDLTRHAESQQKHMABKMPAK 


6930 


131 


545 


FKDTANVFVS L FQMRNNFRH Y PI B PSQLKLFYDVI TWI VTQ VAI 
SYTWPPVLLS IKPSLTFYSSMYYCLHXLGII/VLLLLPVKKTQR 
RKbmOWIQI^OSKKFDBGBNSIjGQNSFSTTh^CNQNQBIASR 
HSSLKQ 


" 6931 


2 


659 


FVWRUPNRPACLLVASGAABGV3AQS FLHCFTMAS TAFNLQVAT 
PGGKAMB FVDVTE SNARWVQDFRLKAYAS PAKXES IDGARYHAJb 
LIPSCPGALTDLASSGSLAR I I»QHFHS ES KP I CAVGHGVAALCC 
ATNBDRSWVFDSYSLTGPSVCEIiVRAPGFARXPLVVEDFVi^ 
ACFSASRPDA VHVVLDRHLVTGQKAS S TVPAVQNI>L FLCGSRK 


6932 


2 


1131 


FVDSPGQGEQAEEEEGGIQMNSRMRAHSPAEGASVESSSPGPKK 

SM^CEGC^IW^GHPGYlSIfflKKrSIKYVSHQHPSHPQLreXVR 

C^CVRSI^C^CMREGPIFFGDEQEIGFVFSHTFFIKDSriARGF 

QRWYSII TIMMDR I YLINS WPFLLGKVRG I IDELQGKALKVFKA 

KQFGGPQRAQRlWTAFTPFIiHQRNGNAARSLTSL^ 

HTSFAWLLKACGSRLTEKLLEGAPTEDTLVQMEKL^LEBESES 

WDNSBAEREEKAPVLPESTBGRKLTQGPABSSSLSGCGSWQPRK 

KTLQEVTDSLLGG WLHAQGVGGI I 


6933 


1431 


890 


SLNIiHCTLPPPPHQYPAGYPSDKEGKKPKGO^KKQPSGTTKRPI 
SDDDC^SASKVYKASDSAKAIBAFQLTPQQQHIilRBDCQNQKLW 
DEVLSHLVEG PNFLKKIiEQS FM CVCCQELVYQP VTTE CPHNVCX 
DCLQRS FKAQVFSCPACRHDI/SQN YIM I PNEIIiQTLLDIiFFPGY 
SKGR 


6934 


3030 


2588 


DRDHSQCX^IRRVAIJU^VSSVKLISKAKIRTVKMTPIIVIiAFIV 
CWTPFFFVQHWSVWDANAPKBASAFI IVMLLASLNSCCNPWIYM 
LFTX3HLFHELVQRFLCCSASYLKGRRIiGETSASKKSNSSSFVI>S 
HRSSSQRSCSQPSTA 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cyeteine, D^Aspartic Acid, E= 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H=Histidine, I~Isoleucine, K-Lysine , 
L=» Leucine , M-Methionine, M=Asparagine, 
P=Proline f Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W-Tryptophan, Y^Tyro3ine, XUnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


€935 


886 


543 


WSALYVAGGNDGTSCLNSVBRYSPKAGAWESVAPMNIRRSTHDl* 
VAMDGWLYAVTSGNDGSSSLNSIBKYl^RT^^ 
VGVAVLELLNFPPPSSPTLSVSSTSL ! 


6936 


1347 


567 


RSHRRQFLSRMiEFFGKSHPPPHRLFRKSLNVGLHYSHIPFLT 
TCIiHFl^RKRLQRGBVGLSVBTSKPQ^ 

MEKRLQEAQLYKEBGNQRYREGKYRDAVSRYHRAI>I^IJ?GLDPS 
LPSPLPNIXSPQGPALTPEQENILHTTQTDCYNNIA^ 
HYERVREYSQKVLERQPDNAKALYRAGYAFFfflXJDYDQARHYLL 
AAVNRQPKDANVRRYIjQIjTQSBLSSYHRKEKQLYLGMFG 


6937 


1 1 


727 


AVEFRCCPGRDPACFARGWRLDRVYGTCPCDQACRFTGDCCPDY 
DRA.CPARPCFVGEWSPWSGCADQCKnmTlVRRRSVQQEPQNGGA 
P CPPI.EERAGCLEYSTPQGQDCGHTYVPAP I TTSAFNKERTRQA 
TS PHWSTHT3DAGYCMBFKTBSLTPHCALBNRPLTRWMQYLREG 
YTVCVDC^PPAlWSVSIiRCSGDGIiDSIXmQTLHJ^ 
TWKKVRRVDQCSCPAVHSFIFI 


6938 


3 


719 


NSRXLELAERVDTDFMQLKKRRQSSEKBNDSGTLDTVGAVVVDH 
KGN VAAAVS SGGLALKH PGEVGQAALYG CGCWAENTGAHNPYS T 
AVSTSGOGEHLVRTI LARECSKAIXJAEBAHQALLETMQNKPI S S 
PFUVSEIXJVI^GVIVIxRSC^CSABPDSSQHKO/n^VEFI^SHTT 
ESMCWYMSAQOGKAKTHISRLPPGAVAGQSVAIEGGVCRLGEP 
SELTLQAECEASQRHFRT 


6939 


3 


810 


KVTAPRRPQRYSSGHGSDNSS VLSGELP PAMGRTALFHHSGGS S 
GYESIiRRDSEATOSASSAPDSMSESGAASPGARTRSLKSPKKRA 
TCLQRKRLIPAPLPDTTALGRXPS 

ElKWfBIDDVERI^RPRPTPREAPTQGIACVSTRIiRIABRRQQR 
LREVQAKHKHLCEELAETQGRLMLE PGRWLEQFEVDPELEPES A 
EYLAAXiEPJlTAALEQCVNXtCKAHVMMVTCFDXSVAA^ PGPQ 
EVDV 


6940 


1188 


496 


GKMAAQPIuRmSRCATPPRGCFCGGTERAlDQAS 

VVKGSSPIiGPAGIjGAEEPAAGPOLPSWLQPERCAVFQCAQamV 

IADSVHIAHDLSRSLGAVVESRVTNNVVLKAPFt.VGIEGSI.KGS 

TYNLLFCGSCGIPVGFHLYSTHAAIJVALRGHF 

K^^CAIVNASEMDIQ^^VPLSEIaABL:<EiC^VLTHNRLKSTMKILS 

BVTPDQSKPEN 


6941 


1 


713 


SLSRADSDPHGPHTCGHVLNVI IGSNVLALAEAQRQAEALGYQA 
VVLSAAMQGDVKSMAQFYGLLAHVARTRLTPSMAGASVEE^ 
HELAAELQ I PDI^LBEALKTMAM^GPVCT JAGGEPTVQLQGSG 
RGGRNQEIALRVGAELRRWPLGPIDVLFLSGGTtt 
AWVTPELASQAAAEGLDI ATFLAHNDSHTFFCCLQGGAHLLHTG 
MTGTNVMDTHLLFLR PR 


£942 


1 


246 


GD WERVDP KTDTWTW3APLSMPTNAVGGCLLGDRLYADGGYDG 
QTYLNTMBS YDPQTOEWT(91A3I^IGRAGAC\n(n^I KQP 


6943 


1 


739 


PmTGDGAKTLAIHVKALTADS IR ITWKATI^ASS FRLSWLRL^ 
HSPAGGSITETLVQGBKTEYLLTALEPKPTYI ICMVTMETTMAY 
VADKTPVCAHABTADS YGPTTTLNQBQNAGPMASLPLAGI IGGA 
VALVFLFLVLGAI CWYVHQAGELLTRERAYNRGSRKKDDYMESG 
TKKDNSlLEIRGPGl^MLPINPYRAlOiEYVVHTIFPSKGSSLCK 
ATHTIGYGTTRGYRDGGI PDIDYSYT 


6944 


960 ' 


156 


VANI LH*GVKYB S 5LTGSS ERAEQ PLSVGRLCS TI CNM PKALKT 
LCVNHFLGWLSFBGMI*LFYTDFMGBVVFQGDPKAPOT 
NSGVTMGCWGM CI YAFSAAFYSft XLKKIiEEFLS VRTLYFIAYLA 
FGI^TGIATI»SRNLYVVI>SLCITYGILFSTLCTLPYSLLCDYYQ 
SKKFAGSSADGTRRGMGVDI SLLS CQYFLAQ ILVS LVLGPLTSA 
VGSANGVhrYFSSLVSFLGCLYSSLFVIYEIPPSDAADEBllRPLL 
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WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AsAlanine, C=Cysteine, D^Aspartic Acid, B= 
Glutamic Acid, F=Phenyl alanine, G=Glycine. 
H=Histidine, I-Isoleucine, K=Lysine, 
I>=I>eucine , M-Me thionine , N*= Asparagine , 
P= Proline, Q=Glutamine, R^Arginlne, 
S=Serine, T=Threonine, v^Valine, 
W^Tryptophan, Y^Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 


6945 


2067 


179 


EGEDKGLPRTMGAMjG TGTRIiAP W PGRACGALPRWT PTAPAQGC 

HSICPGPARPVPLKKHGYDVTRNPHUJKGMAFTLBER^ 

I PPCFLSQDVQIiLRlMRYYKFjQQSDIiDm ILMTLQDRNEKLFY 

KV Jjl oUVCiIuT Pllr J. V 1 1 Jr I VuUniJUfSXwuXJriUMrAAatljf * xnu 

hiatmi^swpednikavvvtdgerilglgdijck:ygmgipvgkia 
lytacxx^/npqqclp vlldvgtnne bllrdplyiglkhqrvhgk 

AYDDLLDBPM(^VTDKFGINC1»IQFEDFAN^^ 

CMFNDDIQGTASVAVAGILAAtiRlTKNKI»3imVTO 

GXIAHI^VMALBKKBGVPKAXEATRKIWXMVDFXKBIiIVQGRIM 

LNHEKra^QD\HPEWSI^BVVRLVKPTAIlGVAAIARA\FTK 

QILRDMASFHKRP\IIFAI^NPTSKABCTA\EKCYRVTBGPRGP 

FAS \ GSPF *GVL I WEMGKTPI PGGRGNNA*R VPRGWQLGVHS PG 

GDPGHIP\DBIFLPDSRAKtiPQEVSBQHLSQGRLYP\PLST\lR 

NVFl^IAIKVFI)*GyKHNLV\SYYPEPKD\KEAPCKIPG8yTPD 

YDS fyt/vds yi waqgkamnvqtv 


6946 


133 


2551 ' 


SCEYSGITVAPGDPCPGVAHI>IAPSMASDTP 
NIiDGTLGYLJjDKETIjRIiHPDI FTjPS EI \CDRLVNEYVBLVHAAC 
NF\BPHB\SFPNPriFRDPRKQPASRRIHL\RBD\LVQD\QD\tB 
AIRKQDL\VEL\YLTN\CKIO^SAKSI J QTU«FSinTiGVP*AF^ 
C\TNIIJXPJCENPGGL/CEDF^FNPTCQ^^ 
P\rJOLGRMlDWVPVES\IJ^PLNSlAAIJ)I£GIQTSDAA\ FKTQ 
WKDSIiXVSLVLNYNMDLSDDIIIRN^VQLHKLRHI^ISRDRLSS 
YYKFKLTkBVLSLF\^KI*HIU4BI©IS 

EAGQTSI\EPSKVSSIIPFRGFBGGPI>QF\M5VF*GIFCGRLTH 
IPAYKVSGDKNBEQVIjNAI BAYTBHRPBITSRAINIJjFDIARIE 
RCWQIXRA1jKLVITALKCHKYDRNIQVTGSAALFYI,TNSEYRSE 
QS VXLRRQVI QVVLNGMESYQBVTVQRNCCXTLCNFS I PEELKF 

QYRRVNEIjLLS ilnptrqdes iqriavhlcnalvcqvdndhkea. 

VGKMGFVVmLKLTQKKIil^KTCI>QV2^EFSW\SALWNI TDETPD 
NCEMFL&FNGM10>FIJ3a2IBFPBKQBLHRN^ 
RPQI»MTSQFISVFSNIJiBSKADGIBVSYHftCJGVLSHIMF^ 
KGVCEPQREEVBERMWAAIQSWDINSRRNIim^SFEPILRliLPQ 
GISPVSQHWATHALYNLVSVYPDKYCPIiLIKEGGMPIiLRDI IKM 
ATARQETKEMAPJCVI EHCSNFKEENMDTSR 


6947 


2 


#82 


TSVSTI PRGIASARPQSRS WRCX^VTORSPGRARGRGIJtMliNVP 
one pn x DD onnP VAfiTSfJR <5 KVPI . KTYTRfi UHDH JH1/PKSGKD1.TG 
LKGR!jIKVTEBEI»KKHNKKD0CWICIRGPVYNVSPYMEYHPGGE 
DBIWRAAGSDGTBIiFDQVKllWVlfreSMLKECLVGRMAI KPAVLK 

dyrbeekkvi^gmlpksqvtdtlakegpsypsydwf 
/ehiy*tegyqfrlnns*sse* flysrnny*gllis ytyw/r*a 
mrfrkifixgl/cbsvgkieivi^kjcfjtowdfwmplkn™ 

rraKDTGLYYRKCQLISKEDVTHDT^ 

HVYLKLPITGTEIVKPYTPVSGSLLSBFKKPVLPNNKYIYFLIK 

XYPTCLPTPEIJJRIiQIGDFVSVSSPBGNFKISKFQEnEDLFIiIA 

AGTGFTPMVKJ:LHYALTDIPSLRKVKLMFFNKTB1)DIIWRSQI^ 

KtAFTOJKR^VBFVI^APISEWNGKQ^ 

SKVLVCICGPVPFTE<X3VRIiLHDIJ»FSKWEIHSFTA 


6948 


104 


58 


PDG^SEFPDEYFTCSSLCI^CGVGCKKSMNHGKEGVPHKAKSR 

CR YSllQYDNRVYTCKACTFJi GEEVS^ATPKTSASTDS PWMGLAKY 

AWSGYVI ECPWCGVVYRSRQYWFGNQDPVDTVVRTE I VHVWPGT 

IK5FIjia)llNNAAQRLLDGMNFMAQSVSBIiSLGPT 

APAYWRPNSQILSCNKC^TSFIOQNDTKHHCRACXJEGFCDSCSSK 

TRPVPERGWGPAPVRVCmCTEAR/TRPVSCYRGTSGR*RRRRT 

QETVB 


6949 


152 


4656 


GI^CLSRPLTRPGDDSVGGSAMASGAGGVGGGGGGKIRTRRai 
QGPIKPYQ(X5RQQH(^II^RVTESVKNIVPGWriQRYFNKNEDVC 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence ; 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A»Alanine, C=Cysteiue, D~Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H^Histidine, I»Isoleucine, K=Lysine, 
L=fceucine, M^Methionine, N=JYsparagine, 
P- Proline, Q=Glut amine, RiArginine, 
S=Serine, T-Thxeonine, V=Valine, 
W=Tryptopnan, Y=Tyrosine, X»Unknown, +=Stop 
Codon, /-possible nucleotide deletion, 
^possible nucleotide insertion) 








SCSTDTSKVPRWPENKBDHLVYADEESSNITDGRITPKPAVSOT 
EE PS TTSTAS T \ YPDVLTRVSLYRSHIiNFSMLES PALHCQPS TS 
SAPPIGSSGFSLVKEIKDSTSQHDDDNISTTS6FSSRASDKDIT 
VSWn^LPPI^SPEAERSHSIiSQm-ATSSiaCPAj^SAPGTLSP 
SLGNSS IIJCTSQLGDSPPYPGK7TT YGGAAAAVEQSKLIINTPYQA 

SIVSSPLNSPIiDI^IDITDFQAKR^KWSQYPPVQRIWPKPV 
S TATNRSVYPKPSLTPSGEFRKTNQRIDKKCSTGYEKNMTPGQN 
REQRESGFSYPNFSLPAANGLSSGVGGGGGKMRRERHAFVASKP 
LEEEEMEGPVLPKISIiP ITSSSLPTFNFSSPE ITTSSPSPINSS 
QM»TNKVQMTSPSSTGSPMP1CPSSPIVKSTEANVLPPSSIGFTP 
SVPVAKTAKI^GSSSTLEPIISSSAHH\nTVNSTNCKKTPPEDC 
EGPFRPAB1LKEGSVLDILKSPGFASPKID3VAAQPTATSPWY 
TOPAISSFSSSGIGFGESIJCAGSSWQCDTCIMNKVTD^ 

WDCDTCLVQHKPEAIKjCVACEro 

OTASSSSCTVTTGTLGPGDKFKRPIGSWECSVCC^ 

VSCKSBIO?GSSVPTSSSSTVPVSLPSGGSIjGLEKFKKPEGIWDC 

elclvqnkadstkojacssakpgtksgfkgfiyrsssssnsaass 
sfkfgvsssssgpsqtltstgnf^fgdqggfkigvssdsgyinp 
msegf* fskhivgfkfgvssbskpee vkkdskin>npkpgl>s fgi» 
snpvfl,tpfqfgvsmijgqeekkbkliik3sa^gfrfgtgvinstr 
vpantivt3enk3sfnlgtietksvsvaplkcqtseakkeempa 
tkggfsfgnvepasiipsasvfvlgrtebkqqepvtstslvfgeg 
kltmkepkc\qpvfsfgefqrqtkdensskstfsfsmtkpseke 
seqpaxatfafg ao/rrntadqgaakpdls ylnnss s s sstpats 
aggg\i fgsstsssmppvatfvfgqssbpgsss\afgntaesst 
sosllfsqdsklattss0x1tavtpfvfgpgassnntttsgfgfg 
attts s sags s fvfgtgpsaps as pafganqtptfgqsqgas q p 
nppgfgsissstalfptosqpapptfgtvssssqppvfgqqpsq 
safgsgttpnsssafxjfgssttnfnptnnspsgvftfgansstp 
aasaqpsgsggfpfnqspaaftvgsngknvfsssgtsfsgrki k 

TAVRRRK 








6950 


2585 


411 


PRPGSRSGLCRRAGERGAVRAGGLSRRTRAE* IMDELHYQDTDS 
DVPEQRPSKCKVKWTHEB^EQLRALVRQFGQQDWKP1»A^ 
TDQQCQYRWIjRVIjlJPDIfVKGPWraEEM 
IAKHLKGRLGKQCRERWIINHIiNPEVKI^ 

LGNRWAE IAKMLPGRTDNAVK1IHWSTIKRKVOTGGFLSBSKDC 
KPPVYXjI.T.ET tEDKDGLQSAQPTEGQGSIiIiTMWPSVPPTI KBKEN 
SEEBtAAATTSKEQEPIGTDLDAVHTPEPIiBEFPKREDQEGSPP 
ETSIiP YKWWEAANL L I PAVGSS I>SEAIJ)L I ESDPDAWCDLS K F 
!)IiPBEPSABDSINNSIiVQLOASHQQQVI*PPKQPSA\IiVPSVTBY 
IUJ)GHTISDI^RSSRGELIPISPSTBVGGSGIGTPPSVLKRQRK 
RRVALSPVTK^TSI^FLDSCNSLTPKSTPVKTLPFSPSQFTiNF 

VTPIXJKYSMDNTPHTPTPFKmLHKYGPLKPLPOTPHLBEDLKE 
VLRSBAGIELIIEDDIRPEKQKRKPGLRRSPIKKVRKSU\I>DIV 
DKtWKLmSTLPKSI^LPTTAPSNSSSLTLSGIKEDNSLLNQGF 
I^AKPEKAAVAQKPRSHFTTPAPMSSAWKTVACGGTRIXiLFMQB 
KARQLLGRLKPSHTSRTLILS 


6951 


1940 


239 


AGPDI^imPvSI^ALyC^I^FlJilliALTEALAFArQKpSPRESL 
a^PSGrPPGTMVTAPHSSTRHTSVVMLTPNPDGPPSQAAAPMA 
TPTPRAJ5GHPPT VFPSPPSLRQ* PPPILKAP/SSTGPAEAAMAT 
TSSKPBGPJRG^AAPTILIjTKPPGATSRPTTAPPRTTTRRPPRP 
PGSSRKSAGNSSRPVPPAPGGHSRSK^a^PJJPSSTPLGQKRP 
1X?KIFQIYKGNFTGSVEPEPSTI,TPRTPIAK^SSSPQPQTVAAT 
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SSQ 
IP 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
c orresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=JUanine, C= Cysteine, D^Aspartic Acid, E- 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
HsHistidine, I-Isoleucine , K=I»ysine, 
L»Leucine, ^Methionine, &=Asparagine, 
P=Proline, Q=Glutamine, R^Arginine, 
S^Serine, T«Threonine, V^Valine, 
W-Tryptophan , Y=*Tyrosine, X-Unknown , *=Stop 
Codon, /^possible nucleotide deletion, 
\«=poaoible nucleotide insertion) 








TVPSNTSWAPITrajXJPAKDKPGLRRAAQGGGSTFTSQGG™ 
TAASGAPVSP/PSCPSAFSAPPPR*PTGWPQP* *UAYCYp\CT 
S RPLS TS SGVFTAATG PTPAAFDTS VSAPSQGI PQGAS TTPQAP 
THPSRVSBSTISGAKBEriVA\PSP*PTGa»VI»SPQWYPQPQAIS 
SrAWSPPGPGSLGQQGTSPMWPRGTNRSTEPPSA*ARWISPG*S 
WPSACPSPP\I>CPADGVLHBBBEEDRQPGBQPEAYGNNTHHPGT 
TFQQACXRGAAPGEIPVPIoKPLRTQI^SRPRSPANGDYRDTGMVP 
C 


6952 


658 


304 


PESEGBSGEMTDRYTIHSQLEHLQSKYIGTNATPTPPSGSGNCE 
PTPRLVTiUjHGPLRP SQLLRHCGE* EQS AS P IQLDGKDASALWT 
ASRQARGELR^CLTTAVRGTSPSVSPVCQSS 


6953 


1512 


349 


NWGKTRALASGKHVPFGKQTNPNKS /VHCDS *G**RRBTTQDBS 
FS PHFRGKMGGW\KI J BKKLBNTKQPVGGNEG *EHEVTGNLNSD 
PIJ*EIiC^CPI>CQLIX»SREQLIAHVYQHTAAVVSAKSYM\ cpvc 
GRAIiSSPGSLGRHLIiIflSEDGRSNCAVCGARFTSFIATFNSEKLP 
B\TIiNMESI»PTVHNEGPSSA^ 

YRKI»I»H^GTPSVRKWAI*RRQNEPLEVRIjQRLBRERT7UCKSRRDN 
BTPREREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRIiKR 
AIBTPKKRQi^LIREREAKRI^RRLEKMDNMLRAQFGQDPSAMA 
AUU^EMNFFQLPVSGVBliDSQIiUaKMAFEEQNSSSra 


6954 


819 


1 


PPPPPIIPSHPRBAST*AG*KRSGDSECSPPVEQ*A*TRAAAQN 
* PQR*RWTBGNSPOASAVATPGQGASPAAPRCTP * PSRRHRRLP 
PGARPPAG*AAPAPTKPWLAGPASAPQPGAAPLSPPAPPLIRTR 
♦CAGAAARGRPRRDRS PRPRTPGGCS WSSPRTPPAVSASAQTPS 
DAG*AGGR*GQRQRPSTGR*PPGVGGAGRSHRREGTIPGNPHPR 
AS * RAGWQR* PGP/REWG-U *EPQGEEMSGPGGPGGAPPNQVGSS 
VMQAMSTG1 


6955 


1968 


782 


PPGRRQVRAQVAGAPVGHWGTRARQVKTGGRRRARRTMPFIrfX)D 
WRSPGWSWIKTEIX5WKRCESCSQKX»ERENfMCNISHSIIIaNSKD 
GEIFNNEEHBYASKKRKKDHPRNDTN^ 

KRHG YCTIjGEAFNKL DFSSAIQDIRR FNYVVKI*LQLIAKSQLTS 
LSGVAQKNYFNXLEKXVQKVLDDHHW 

/N*RSREVCISGKHQYLDLP I RNY6RIATTATGSSDD* ASE \NG 

LTLSDLPLHMLNNILYRFSDGWDIITLGQVTPTLYML»SEDRQLW 

KKT^YHFABKQFCRHLILSKKmiBWKLMYFALQKHYPAK^ 

GPII1HFCRHCSILFWKDSGHPCTAADPDSCFTPVSPOHFIDI1FK 

F 


6956 


8605 


3839 


QfXSTSIFASFTSPPVIX^^VI^DMSFDIj^GSDABOKEMBTQSS 
DFPPSLTQPAPDQSSTIQI»HPATSPAVSPTTSPAVSI.VVSPAAS 
PErSPKVCPAASTWSPAVFSWSPASSAVLPAVSIiEVPLTASV 
TS P KAS PVTS PAAAPPTASPANKDV3S FIiBTTAJDVBB ITGEGLT 
ASGSGDVMRRRXATPEEVRLPLQHGWRREVRIKKGSHRWQGETW 
YYG PCGKRHKQFPEVI KYkSRNWHS VRRKHFSFS PRMP VGDFF 
KERDITEGLQWVQIiSASElPSRIQAlTGKRGRPRNTEKARTKEV 
PKyKRGRGRPPKVKITEIJ^KTDNRPLKKLEAQKTIiNEEDKAKI 
AKSKKKMRQKVQRGK(_OTriQG0ARNKHKQKTK6LKQKEAKKK5 
KAKKKKGKTXOEKLKEKVKRKKKEKVKMXKKKRVrKAKPACKAP 
XTLATQRRLRERQRQQMILEEMK1GPTEDMCLTDHQPLPDFSRVP 
GLTLPSGAF SDCLT 1 VBFLH5FGRVLGFDP AKDVPSLGVLQEGL 
LCQGDSI^BVQDLLVRI*LKAALHDPGFPSYCQSliKILGEKVSEI 
PLTRDNVSE I LRCFLMAYGVE PAI1CDRI1RTQPFQAQP PQQ KAAV 
LAFLVHELNGSTIiI INEIDKTLESMSSYRKTTKWI VKGRLRRLKT 
VLAKRIGRSEVEMFX3PEECIGRRRSSRIMRVTSGMEEEEEEBSI 
AAVPGRRGRRDGEVDATASSI PELERQIEKLS KRQIiFFRKKDLH 
SSOMLRAVSLGQDRYRRRYWVLPYIJUSlFVEGTEGNriVPEEVIK 
KBTDSIJCVAAHASLNPALFSM KMELAGSNTTAS S PARARGR PRK 
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WO 01/53312 



PCT7US00/34263 



SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C- Cysteine, D=»Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I«l3oleucine, K=Lysine, 
I»=Leucine, M=Methionine, N^Asparagine , 
P=Proline, Q=Glutamine, R^Arginine, 
S»Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, X=0nJcnown, *-Stop 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 








TKPGSMQPRHLKSPVRGQDSEQPQAQLQPEAQLHAPAQPQPQLQ 
LQLQSHKGFLEQEGS PDSLGQS QHDI>SQSAFLSWLS QTQSHS S L 

lsssvltpdsspgkldpapsqppebpbpdkaksspdpqalwfni 
saqmpcnaaptpppavskdqptpspqqiasskpmnrpsaanpcs 
pvqfsstpiiagiiapkrragdpgbmpqsftglgqpkrrgrppskf 
fkqmeqryltqbtaqpwpemcsg>^wwirdpb^u j dami.kaiihpr 
glrekalhkhlnkhrdpiiqkvclrpsadpifeprqlpafqegim 
swspkektyetdlavlqwvbblbqrv1msdi<jirgwtcps post 
redlaycehifsdsqrditwrgrgregiiapqrkttnp 

AAIiEQNVEREYLRKPLWPTHEnnO^KAIil^TP^ I S 
YEITPRIRVWRQTI^RCRSAAQVCLCLGQLRRSIAWEKSVNKVT 
CLVORKGI^DEFIiliCDGCDRG 
aJvaWECTCFTQKPGPPKR^^^ 

GRESPAAGPRYSEEGI>SPSKRRRI>3MRHHHSDLTPCEI ILMEME 
SHDAAWP FIjE P VNPRLVSG YRR 1 1 KNPMDFSTMREPJUCRGGYTS 
SEEF7VAnA3jLVFDNCQTFNEDDSEVGKAGHIMRRFFE\SRWKKF 
YO^KQGQSVROGRWGVTL?*HLPPTFQTKTCHFHLJ^ 
RYNPDF 


$957 


82 


3514 


HLIVAMPEPTK3CKENEVPAPAPPPEEPSKEKBAGTTPAKDWTIjV 
ETPPGEEQAXQHAN6QI>S I LF IEKPQGGTVKVGEDI TFIAKVKA 
BDLSEKPTIJKGSRKWMDIiASKAGKHLOLKBTFERK^RVY^ 
IIKAKDNFAGNYRCEVTYKDKFDSC^FDLEVHESTGrrPNIDlR 
SAFKRSGEGQEDAGELDFSGLLKRRBVKQQEEEPQVDWELI*KN 
TKPSEYEKIAFQYESPTCSGMLKRLKRSIREKKKSAAFAKILDP 
VYQVDKGGRVRFVVELAD P KLRVKWNKNGQELRP STKY I FEDTR 
C^SIIOTDNCQMTDDSEYYVTAGDEKCSTELLVREPPIMVTKQL 
EDTTDYCGERVELECEVSEDDAQVKWFKNGEE I ILVQTRYRIRV 
EGKKH ILI IBGATKAX1AADYSVMTTGG05SAKLS VDLKPI»KILT 
PLTDQTVNLGKEI CLKCB r SENI PGKWTKNGLPVQBSDRLKWH 
KGRIHKIiVIDHALTEDEGDYVFAPD AYNVTLPAKVHVIDP PKI I 
LDGUJADNTVTVtAGHKIJUUEIPISG 

SGRIRTBSYPDSSTLVIDIAERDDSGVYHINLKNEAGEAHASIK 
VKVVDFPDPPVAPTVTEVGDDWCIMNWEPPAYI)GGSPIliGYFIE 
RKKJCQSSRWMRU^LCKETTFEPKKM IEGVAYEVRI PAVNA\l 
GISKPSMPSRPPVPLAVTSPPTIJiTVDSVTITn^VT^ 
GAAGIJX3YVI^CFEX3STSAKO^BNGEAAYDIiPAEDWrVANKD 
LIDKTKPTITGLPTDAKI FVRVKAVNAAGASEPKYYSQPI bVKE 
IIBPPKJHSPKHIiKCyrYIRRVGDRVILVIPFQGKPRPELTWKKD 
GAEIDKNQINIRNSETDTIIFIRKAERBHSGK^DLQVKVDKFVE 
TASlDIRIIDRPGPPQrVKIEDVWGRWALTWTPPimDGNAAlT 
GYTIQKADKRSME WLRVIEHI IEPVPHTEIA7IGNEYYFRVFSEN 
MCX3LSEDATMTKE SAVIARDGKI YKNP VYEDFDFS BAPMFTQPL 
VNRLCHSGYMATIiNCSVRCT^KPKI TWMKNKVAI VDDPR YRMFS 
NQGVCTLEIRKTSPYDGGT^CCKAVNDLGTVEIECKLEVKVIA^ 


6958 


274 


1663 


PRTSRVKTEGSQGSSAMDFSVKVDIEKEVTCPICLELLTEPJUSL 
DCGHS FCQAC I TAKI KESVI ISRGESSCPVCQTRFQP<3NLRPNR 
HLAN1 VERVKEVIOTSPOEGQKRDVCETmGKKLQI FCKEDGKVIC 
WVOSLSQEHQGK^FRINEWKECXJEKI^VAI^RLIKENQEAEK 
I^DDIRQERTAWKNYIQIBRQKJLKGFNEMRVILDNEEQRBLQK 
I^E^EVNVLDNLAAATDQIjVOQRQEtAS TLX S DLQRRLRGS SVEM 
WJDVIDVMKRSESWTIJa(PKSVSKKLKSVFRVPDI^GMLQVI*KE 
LTDVQYYWVDVMLNPGSATSNVAISV^ROVKTVRTCTFKNSKP 
CDFSAFGVFGCQ YFS SGKYYWEVDVSGKIAWIIjGVHSKISSLWK 
RKSSGFAFDPSVNYSKVYSRYRPQYGYWVIGLQHTCEYNAFEDS 
SSSDPKVLTI»FMAV\ LPWLGFS 


6959 


1 


1469 


SLVHWEFGRGIBDFPYLFF0LTHC3QQR ICSVTGAG VQWCDE^S 
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WO 01/53312 



PCT/US0O/3424S3 



SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, (^Cysteine , D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M=Methionine f N-Asparagine, 
P^ Proline, Q=Glutamine, R=Axginine, 
S=Serine , T=Threonine , V»Val ine , 
WsTryptophan, Y=Tyrosine, X=Unknown, *»Stop 
Codon, /^possible nucleotide deletion, 
V=possible nucleotide insertion) 








LQPQTPGLNQSStn^LLSSRDYRMIiSSFNEWFWQDRFWLPPNVT 
WTBLEDRIX5RVYPHPQDLLAALPLALVLI^RLAFERPIGLPLS 
RWLGVRIX)TRRQVKPNA1XEKHFLTEGHR 
TI »QQTQRW FRRRR^QDRPQI*TK3CFCEASWRPIjPYLSS fvgglsv 
LYHESWLWAPVMCWDRYPNQLTLS CPAADSEA\SLYWWYIjLELG 
FYLSLL I RLPFDVKRKGGG P SSI KPRPHYDPPSTA\ DFKEQVTH 
HPVAVIIiMTFS YSANKLRIGSL VLIiLHDS SDYLIiKACKMVNYMQ 
YQQVCDAIiFLI PBEVFTYTRLVIiFPTQInYTTYYES ISMRGPFF 
G YYPFNGbliMLLQIiLHVFWS CL I LRMIiYS FMKKGQMEKDI RSDV 

eesdssebaaaaqbplqi^gtaggprj?aptdgprsrvagrl™ 

RHTTAT 


6960 


387 


2068 


AKWARHKEMQEF\TRSFF\RGRPDLSTLTHSIVRKRY1AHSGRS 
iiLKPKEKQALK^LVEKEPLKMQVDKAy\SRKDKU)LTKKGKRPPT 
PCSDPERKRFRPN3 E S ESG3KAS S PDYFGP PAKNGVAS RSHTHP 
KEKNPRRA \ 3 KAVEE S S DE KRQRDLPAORG KESS E EE K KG YKG K 
TRKKPWKJXiAPGKASVSRKQAREKSEESEAEPVQRT^ . 
KGTKS JjKESEQESEEE ILAQKKEQREEE VEEEKKEKDEEKGDWK 
PRTREt«3RR5CSAREERSCKQKSQAKRLIiGDSDSBEEQKEAASSG 
DDSGRDREPPVQRKSEDRTQIiKGGKRLSGSSKDEEDSGKGEPlA 
KGSRKMARIX5STSGEESDLHRBVSDSEAGGGPQGERKNRS3KKS 
SRKGRTRSSSSSSDGSPEAKGGKAGSGRRGEDHPAVMRLKRYIR 
ACGAIIRNYKKLLGSCCSHKERLS I IiRAEUSALGMKGTPS I/JKCR 
AEKEQRF^AAEVASLDVANI ISGSGRPRRRTAWNPLGEAAPPGE 
IiYRRTI*DSPEERPRPAPPDWSHMRGI ISSDGESN 


6961 


340 


1646 


RPWS3 PTMKPNFSLRLR I FNLNCWGI PYIiSKHRADRMRRJEGDFIi 
NQESPDLALLE EVWSBODFQ YLRQKLS PTYPAAHHFRSGX IGSG 
LCVFS KHPI QELTQHI YTliNGYP YM I HHGDW FSGKAVGLLVIjH L 
SGMVI^YVTHI^EYNRQKDIYIJUmVAQAWFJ^ 
ADVVLLCGDLNMR^FJ)IAjCCIi 

im«VPKNCYVSQQELKPFPFGVRIDYVLYKAVSGFYISCKSFE'r 
TTGFDPHRGTPI^DHFAI^TIjFVRHSPPQO^PSSTHGPVAERS 
PL/MCVCI J KEAIJX^IX3W^\0ARWWA\TFA\SYVIGI^L\LL 

iallcveaagggageaai I.LWTPS vglvewagafylfhvqevng 

LYRAOAKI^HVLGRAREAQDLGPEPQLYAI*L\IjGQQEGDRTKEQ 


6962 


340 


1646 


RPWSSPTMKPNFSI^UJilFm.NC^IPYIiSKnRADRMRR^ 

NQES FT>IJU^F^SFX}DFQ YliRQKI^PTYPAAHHFRSGt IGSG 

LCVFSKHPIQELTQHTriXNGYPYMlHHGDWFSGKAVGI^ 

SGMVI^YVTHLHABYNRQKDrYU^VAQAWELAQFIHHTS^ 

ADVVI*I*CGDIjNMHPBDIiGCCliIjKEWTGI^^ 

^^^MVPKNCYVS^QE1,KPFPFGWIDYVLYEAVSGFYTSCKSFET 

TTGFDPHRGTPI^DHEALK^TLFVRHS P PQQNPS STHGP \AERS 

PL/MCVCUCFALIX^IiGIX5MA\OARWWA\TPA\SYVIGI/3L\l^ 

LAIiCVIAAGGGAGFAAILLWTPSVGLVLV^ 

LYRAQAELQHVLGRAREAQOLGPE PQLYATjI>\LGQOEGDRTKEQ 


6963 


374 


2618 


RVJPL 1 LiKIiLKKP KTAEKQKASEENE I TQPGGS SAKPGL PCLNF 
BAVLS PDPAJjIHSTHSIj TWSHAH TGi>i>ULUlok,KL>PTi. i£KXH£> Hi 
LHNFSNSVLETLNEQRNRGHFCDVTVR IHGSMLRAQRCVLARGS 
PFFQDKLLLGYSDIEIPSVVSVQSVQKLXDFMYSGVLRVSQSEA 
LQII/FAAS HjQIKTVIDECTRIVSQNVGDVFPG IQD3GQDTPRG 
TPESGTSXsO^SDTESGYLQSHPQHSVDRXYSALYACSMQNGSGE 
RSFYSGAVVSHHETALGLPRDHHMEDPSWITRIHERSQQMERYL 
STTPBTTHCPJCQPRPVRIQTLVGNIHIKQEMEDDYDYYGQQRVQ 
ILERlJBSBECn , BDn)OJu?GTBSEPKGESFDSGVSS S IGTEPDS V 
ECWFGPGAARDSQABPTQPEQAAFJVPAEGGPO/rNQLETGASSPE 
RSNBVEMDSIVITVSNSSDKSVLQQPSVNTSIGQPLPSTQLYLR 
QTETLTSNI*RMPLTLTSNTQVIGTAGNTYI*PALFTTQPAGSGPK 
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SEQ 
ID 

NO; 


Predicted 
beginning 
nucleotide 
location, 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cyateine, D=Aspartic Acid, B= 
Glutamic Acid, ^Phenylalanine, <3*Glycine, 
H=His t idine , I ^Isoleucine , K=Lysine , 
L=I»eucine, M-Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S ^Serine, T=Threonine, Valine, 
W»Tryptophan, YVTyrosine, X=Unknown, *=Stop 
Codon, /«possible nucleotide deletion, 
\=>possible nucleotide insertion) 








PFLFSLPQPIAGOQTQFVWSQPGLSTFTAQLPAPQPLASSAGH 
STASGQGSKKPYECTLCNKTFTAKQl^KHMFVH^ I 
CWRSPSLKDYLIK\HMVTHTGVimYQCSICNKRPTQKS 
RI^GEKSYBCYICKKKFSHKTLI^RHVALHSASl^ 
GARAGPPGVVACTEGTreVCS VCPAKFDQI EQFND^^ 


6964 


1 


178* 


SGRPFPFFFSNTDVYF IICJCVTJJRI^AGSSYKMTRMKSIGKIIiLIj 
QIFIG\NCSMFVLVI 


6965 


757 


208 


NVFIEPRIQGFMKTSAHPGQKHPDPSMGLbFPIiLAAIJEVCSCGS 
SGS LG YNLPQNH \Gr.LGRimjVLLGQMRRISPFLCLKDitS DFR F 
PQKKVF^/SQLQKA\QAMSFLYWLOX)VFNF^ 
PGPTPHFTSSAAGTPGDLLGAGIX3PJIRS WG0>JV X EGSTLALRRY 
FQESISTLB 


6966 


820 


1867 


I ITAI/?VRGM PGCP^PGCX?MA<?PR &rv*qQp 
ALRSRGTATACRLDNKBSBSWGMiLSGERLDW 
liSGVFPLIiVI PLB^TMLHSEAGAKRIiKQItLSFAl£GIJX39VFI> 
HIJJPFAWAYTCSASPGGEGQSLOXXXX^^L^IAGirjTFIxALKK 
/HVPGQQGGGDQPGPCXiRPHCGCRRAQWRPLSGPAGCIUyiPR 
GP\DI KVSGYrjnJ^n-IDNFTHGLAVAASFLVSKKIGI^TTMA 
ILLIIK I PHEVGDFAIIiLPJVGFDRWSAAKLQLS TAIiGGLLGAG PA 
ICTQ3 PKGVBETAAWVLPFTSGGFLYXAXiVN^ 


6967 


162 


633 


GFIiPFKYWILDLSASSRMETEJO^PMELSSMSGFBBGSBIJfGFEG 
TDMKDMRIiEAKAWNDVLFAVNNM FVS KSI^CADDVAY INVKTK 
ERNRYCI^TRAGLKWG YAFDQVDDHLG/TC^ 
SPAYREAFGKR\LLQRLEAI>KRDGQS 


6968 


1 


2265 


RGGGGGRGGPGARERERPGEPERTMEAAAGGRGCFQPHPGEQKT 

L EQ FHLSS^S LGGPAAFS ARWAQEAYKKESAKSAGAAAVPAPV 

PAATEPPPVIiHLPAIQPPPPVI»PGpFFMPSDRSTER(^TVI»EGE 

TISCFVVGGElO^CLPQIIiNSVLRDFSLQQIN^^ 

CTADQLKILKVMGIIiPFSAPSCGLITKTDAERLCNAIJjYGGAYP 

PPCKIG^AASIiAljGLBLSERSVR\rYHK\CFGKCKGIj\LVPFJjYS 

SPSAACIQ<;]UD\CREMYPPHKFVVHSHKAU?^^ 

NWRAYI IJjSQDYTGKEBQARLGR \ CIJ5DVKBKFDYGNKYKKRVP 

RVSSBPPASIRPKTDDTSSQSPAPSBKDKPSSVR»RTLAGSSNKS 

IiGCTVHPRQRLS AFRPWSPAVS ASKKBLS PHLPAL I RDSFYS YKS 

FBTAVAPNVAIAPPAQQKVVSSPPCAAAVSRAPEpLATCTQPRK 

rklitvdtpgapbti^pvaapkkdkdsfjywevesrf^ftsslss 

ls s ps ftss s s akdlgs pgaralps av pdaaa padapsgleae l 

ehlrqaleggldtkeake kflhe wkmrvkqke klsaai.qakrs 

lhqfxeflrvakkekxireateakrl^ 

nbsrlrjbkr^lbg^^^vcdkgceagriarakysaqiedlqvkl 

qimeadrbqlradijlrrrearbhlbk\vvk\ei^eqlwprarpe 

aagsegVaablbp 


6969 


1855 


118 


AGTMHGRLKVKTSEEQAEAKPJ^EREQKIiKLYQSAl'QAVFQKROA 

GBLDESVLELTSQILGANPDFATLWNCRREVIjQQLETQKS peel 

AAL\nCAEIX5FLBSCLRVNPKSYGTVmHRCWLLGRliPEPW 

ELCARFU^EPJ^HCWDYRRFVATQAAVPPAEEIiAFTDSLITR 

NFSNYSSWHYRSCLLpQIjHPQPDSGPOXSRLPBDVLLK^ 

AFFTDPNDQSAWFYHRWIiLGRADPQDALRCLHVSRDEACLTVSF 

SRPLLVGSRhffilt^LMVDDSPLIVEWRTPDGRNRPSHVWLCDLP 

AASLNDOJ^PQHTFRVIWTAGDVQKECVLIaKGRQEGWCRDSTl'DE 

QLFRCKLSVEKSWL<}SET^SCKEIjQEI^ 

RALDPLLYEKETI^YFOTLKVAWDPKRATY\LDDLRSKFTxEENS 

VLKMEYAEWVUflJ^KDLWI^LKQLIJ J VTIILDLSIINRLRTL 

PPALAALRCLEDPPPRT\VIiQASDNAI ESUX3VTNLPRLQELLI, 

CNHRI^PAVI^PI^CPRI.VriNLQGNPLCGA 

PSVSSVLT 
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NO: 
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beginning 
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location 
corr e sponding 
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amino acid 
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amino acid 
sequence 


Predicted end 
nucleotide 
location 
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to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F- Phenyl alanine , G=Glycine, 
H=Histidine, I-lsoleucine, K=I»ysine, 
Ii=I*eucine, M-Methionine , N=Asparagine , 
P= Proline, Q^Glut amine , R=Axginine, 
S eS er ine , T=Thr eonine , V» Val ine , 
W=Tryptophan, Y=Tyrosine, X«Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 


6970 


3 


152B 


SFPPLlASSPSAVGEGKVAVAAPCTORSECARAKMAyiQIiBPLNE 
GFLSRISGLLLCRWTCRHCCQKCYESSCCQSSEDEVKILGPFPA 
U-IPPWI^SRSSDKDGDSVHTASKVPLTPRTNSPDGRRSSSDTS 
KSTYSLTRR I SSLBSRRPSSPLI DI KPIEFGVLSAKKEPIOPSV 
LRRTYNPDDYFRKFEPHLYSLDSNSDDVDSLTDEEILSKYQljGM 
UIFSTQYDLIJINHLTVRVI BARDLPPPISHDGSRQDMAHSNPYV 
KICIJ^PDQKNSKQTGVKRKlXJKPVFE^YTFEIPFIiEAQRRlXL 
LTVVDFDKFSRHCVIGKVSVPLCEVDI,VKGGHmJKALIPSSC3^ 
VFJASEI^t^IJfnrLPSAGRIiNVDVIRAKX} 

QLVHGI^VKTKiCTSFI^GTIDPFYNESFSFKVPQEELENASLV 
PWFGHN^ntSSlroFIGRIVIG\QYSSGP\SEPm^MI«lMIJm^RT 
AVEQWHSLRSRAECDRVSPASLEVT 


6971 


37 


3702 


ACF YVPGSRS FKLI PR HGI*VNMGRSG KLPSG VSAKLKR WKKGHS 
SDSNPAICOTROAARSRFFSRPSGRSDLTVIlAVKLHNEIiQSGSL 
RLGKSEAPETPHEEEAEI»VLTEKSSGTFI»SGI»SIK!TNVTFSKVQ 
RFWBSNSAAHKKICAVLAAVTRVIRSQGGKETETEYFAALIRKA 
AQHGVCSVLKGSEFMFEKAPAHHPAAISTAKFCIQEIEKSGGSK 
EATTTLHMLTLLKDI*LPCFPEGLVKS cs etllrvmtlshvlvt a 
CAMQAFHSLFHARPGLSTLSAELNAQIITALYDYVPSENDLQPI* 
IAWUCVMEKAHINI.VRLQWDr/;i^ 

LTAATQSLKE ILKECVAPIIMADIGS VTSSASGPAQSVAKM FRAV 

EEGLTYKEHAAHSSVLQLIiCVFFEACGRQnHPVMRiaMSI^Ii 

RX^PHTPHTAAIiDOAVGAAVTSMGPEVVLOAVPIiEIDGSBBT^ 

FPRSWIiLPVIRDHV^ETRXjGFFTTYPLPIANT^ 

STVES KI YDTUlHQMWTLLPGFCniPTDVAI SPKGI^TLGPIAI 

SERPDr^VTVCQAJ^TLITKGOQAEADRAKVSRFAKNFLPILFN 

LYGQPVAAGDTPAPRRAVLETIRTYLTITDTQLVNSLLEKASEK 

VIjDPAS SDFTRX*S VLDLWALAP CADEAAIS KL YST IRP YLES K 

AHGVQlQCAYRVIiEEVCASPQGPGALPVQSHIjBDLKKriiLDSLRS 

TSS PAKRPRIJCCIiEiHIVRKLSAEHKE FITAL IPE VI LCTKK VSV 

GARKNAFAIXVEMGHAFLRFGSNOKE ALQCY LVL IYPGLVGAVT 

MVSCS II^THLbFEFKGIiMGTSTVEQLLENVCLUiASRTRDVV 

KSAICFIKVAVTVMDVAHLAKHVQLVICEM 

LRNLFT\KFIPK\FGILTWGKKAVGPKEYHRVI 1 WIRKARARAK 

RHRALSQAAVBEREEEEEEBEPAQGKGDSIBBILADSBDBEDNE 

EEERSRGKi^RKLARQRSRAWLR^GGGDEPUfFljDPKVAQRVlA 

TQPGPGRGRKKDHS FKVSADGRLI IREEADGNKMBEEBGAKGED 

EEMADPMEDVIIRNKKHQKIiKHQKEABEBEIiEIpPQyQAGGSGI 

HRPVAKKAMPGAEYKAKKAKGDVKKKGRPDPYAYI PLNRSKIJJR 

RXKMKLQGQFKGI*VKAAQRGSQVGHKNRRKDRRP 


6972 


2179 


973 


PGGAILLPLWRRTRPREATVPRGAAQRGRARSAEGRI PSSQSPS 

PAEAGGATRSPPPRPPRPARPPGPSAPPIJJlSDAGPGATVsAAA 

AAATBTlARRGATMGAQLSTIiGHMVliFPWFLYSLLM 

AITLBS PDIRYPIiRIilDRBI ISHDTItf*FRFAIiPSPQHIIX3liPVG 

QHIYLSARIIX^VVRPYTPISSDDDKGFVDLVIKVYFKDTIIPK 

FPAGGKMSQYLESMQIGDTIEFRGPSGLI.VYQGKGKFAXRPDKK 

SNPII RTVKSVGM IAGGTG I TPMIiQ VI RAIMKDPDDHTVCKLL p 

ANQTElO)ILia?PELEKLRNKHSARFKLWYTI.DRAPEA*^ 

FVNEBMIRDHLPPPE\EEPLVLMCX3PPPMIQYACLPNL\DHVGH 

PTBRCFVF 


6973 


1 


1964 


IKJPRCMRGLRAQKOSRPAPGVDAMVLCPVIGKI^ 

SPRRQEILSNAGIiRFEVVPSKFKEKLDKASFATPYGYAMBTAKQ 

KALEVANRLYQKDLRAPDWIGADTIVTVI^LIIjEKP 

RMLSRFE /SGRE1ISVFTX5VAIVHCSSKDHQLDTRVSEFYEBTKV 

KFSE1^EEIXWEYVHSGEPMDKAGGYGIQAIX5GMLVESVHGDFL 

NWGFTLmFCKQLVKLYYPPRPEDLRRSVKHDSIPAADTFFJDL 
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SEO 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

cor re sponding 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
(A»Alanine, C= Cysteine, D^Aspartic Acid, E= - 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Hietidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, Ns*Asparagine . 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonlne, V=»Valine, 
W=Tryptophan, Y«Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 








SDVEGGGSEPTQRDAGSRDBKAEAGEAGQATAKABCHRTRETLP 
P FPTRLLELIEGFMLSKGLIiTACKLKVFDLLKDEAPQKAADIAS 
KVDASACGMERLLDI CAAMGLLKKTEQG YSNTETANVYLASDGE 
YSLHGFIMHNNDLTWNLF/TYI*^ 

QDAY YOS PETRT.R FMRAMHGMTKr.TACOVATARMT^ ft I?q q A mv 
GGCTGALARELAREYPRMQVTVFDIiPD 1 1 BIAAHFQPPGPQAVQ 
IHEW^DFFRDPLPSAELYVLCRILHDWPDDKVHKLLSRVAESC 
KPGAGIiliLVETLLDBKKRVAQRALMQSLNMLVQTEGKERSriGEY 
OCLIiELHGFHOVO WHTifiGVLD A T T.\ PPmiPPRAHA Af 1 *;!". 


6974 


3082 


2172 


RSCAAFASFASRPPLBLFAPPGSHRSPPGRGVATSAQOU^VRK 
LLAAP^IXSTKYQATMVYKTLFALCILTAGWRVQSIiPTSAPLSV 
S LPTNIVP PTTIWTSS PQNTDADTAS PSNGTHNNSVLPVTASAP 
TSLLPKNISIESREEEITSPGSNWEGTNTDPSPSGFSSTSGGVH 
LTTTI^BHSIiGTPEAGVAATLSOSAABPPTLISPOAPASSPSSL 

PTSHATABPVPQKKTPPTTVSGKVMCELIDMET\PPPFPG 


6975 


2 


500 


RPRPTVHCCKWALKI^rAMETI,IlWPHAHSGKEGDKYXI^K^ 
KELLQTELSGFIJJVKELML*ATEAIiKTFEEA + KSPI IQCSSSRS 
SLPPATQPPPYL^LSAVPFPIHLPLPLLPPQAQKDVnAVDKVMK 
BLDENGIX}EVDFQBYVV1#YAAI,TVACNNFFWBNS 


6976 


1216 


970 


(Z(T>T»'*\7RVfl'TT'T?KIQ U V7*P VAH PPl?TVT T \rt7<*MrBTr'g\y/T>TfWlPT-i 'PURL) T 

^MEAGTLAKLNTPGELCIRGYC^IJGYWGRPQKTESAVDQDKW 
YWPGDVATMNFOGFriCT VGR *lKTWITTRGnPNT VPflRT.l?nJ?J7HTH 

PKVQEVQ WGVKDDRMGEE I CAC I RLKDGRETTVEEI KAFCKGK 
ISHFKIPKYIVFVTNYPIiTISGKIQKFKLREQMERHLNL* IKQQ 
ACPGRLA 


" 6977 


1298 


588 


SLFINTNLLSNQIRKTSFGMCSEPISDNTKDQKGKLKTPDFA*R 

ANKKSKHHVNGNRTVBPFPEGTQMAVFG^K3CFWGAKRKFWVIiKG 

VYSTQVGFAGGYTSNPO^KEVCSKKTGHAEVVRVVYQPEHMSFE 

KJ^KVFWENHDPTOGMRQGNDHGTQ^ 

KEOTQKVI^BHGFGPITTDIRBGffrc^ 

CGLGGTGVSCPVGIKK 


6978 


3 


242 


SFPFRDSRRCCCCKGSSLRHTAVAMVKLSKBAXQRLQQLFKGSQ 
FAIRWGFI PLVT YLG FKRGADPGMPEPTVLSIjLWG 


6979 


3917 


1146 


DEAR VRGEA VAAAIJjSR CRHWSGPPPFPPSPPDKKGLRGTE P WE 
AGWSGATrcARAMDVRRLKVNELREKr^RRGIJ^TRGLKTFrrJ^ 

rlo^valeaeepddereij>addepgrpghineevetegg^eiiegt 
aoppppglqphaepggysgpz>otamdnitrq^qfydtqvikqe 
nesgybrrplb^b2qoxjayrp^ktemkogaptsfijppe7vsqilkp 
drqqfqsrfj*pybenrgrgyfehredrrgrspqppaeededdfd 

DTLVATDTYNC3>LHFKVARDRS SGYPLTIEGFAYIj WSGARAS YG 
VRRGRVCFEMKINEEISVKHLPSTEPDPHWRIGWSLDSCSTQL^ 
GEEPFSYGYGGTGKKSTOSREENYGDKFAENDVIGCFADFECGN 
DVELS FTKWGKWMGIAFR IQKEAI/5GOALYPHVLVKNTCAVEFNF 
GQRAEPYCSVLPGFTFIQHLPLSERIRGTVGPKSKABCEILMMV 
GIjPAAGKTTWAIKHATVSNPS KKYNILGTNAIMDKMRVMGLRRQR 
NYAGRWD\^IO^TGJ3JmiaQI^^ 

RKMRPFEGFQRXAIVICPTDE1DLKDRTIKRTDEEGKDVPDHAVL 

EMKANFTLPDVGDFI^EVLFIEIXJREEADKLVRQYNEEGRKAGP 

PPEKRFDNRGGGGFRGRGGGGGFQRYKNRGPPGGNRGGFQNRGG 

GSGGGGMYRGGFNRSGGGGYS^RWGNNNRDJJNNSNNRGSYNRA 

POXXJPPPO^PPPPQPPPQQPPPPPSYSPAPJfc^PGASTYNKNSNI 

PGSSANTSTPTVSSYSPPQSFGFFPSTFO.PSYSQPP 

GxTT^PPPPPPPPAYNYGSYGGYl^APYTPPPPPTAQTYPQPSY 

NQYQQYAQQWNQYYQNQGQWPPYYGNYDYGSYSGNTQGGTSTQ 


6980 


1 


420 


GTRGRKTGRVAAPSTRRRTGNMQKLQTRS PAMSLS DPGLGYHPT 
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ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
bo first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
(A^Alanine. CwCysteine D=>Asnairtic Acid 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-Histidine, I=»l3oleucine , KeLysine, 
I^Leucine, M=Methionine, N=*Asparagine, 
P=Proline, Q=Glutamine, R==Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y«Tyrosine, X-Unknown. *«Stop 
Codon, /-possible nucleotide deletion, 
\»possible nucleotide insertion) 








OTTLRWPPLCSUIALHVFHCLFS5RLGTPVSPRLAMDPNCSCKA 








SEKCSCCA 


6981 


10 


1054 


PGRGPRRASI^PAPAARGVFQGGLGQAKQARTRACAAIjPTPHPS 
APRLLEPO<OTSLPPPPPGPWPlWILTKAQYDKIAG<:iiV5VPPT 
HQSLRKLKQRFPSQSQATIJiSIFSQByQKHIKRTHAKHHTSKAI 
E S YYQRYLNGVVKNGAAPVIiLDIiftNBVD^ LERFLQ 
BHEETPPSKSI INSMLRDPSQI PDGVLAIf QVYQCIVNDCCYGPL 

ILQVPVAVBGHI IHW I ESKASFXJDECSHHAYLHDQFW S YWNRPG 
PGLVIYWYGPIQBIJJCNRERGIIXKACPTTNIVTLCHS IA 


6982 


153 


1285 


FPO^DCSAPAAPGLAGSEPRRLRAYRRRKQRARGIiKRVAWIiAPP 
PSIJXX?IiQGWAQAPVt)3TLGPEDSRASSPMIQNSRPSLLQPQDV 
GI^IVETLMLHPVTKAFLCGSISGTCSTLLFQ^ 
PSDHGSRRVGMLAVLLKVVRTESLLGLWKGMSPSIVRCVPGVGI 
YFGTLYSLKQYFLRGHPPTALBSVMLGVGSRSVAGVC^ PITVI 
KTRYBSGKYGYES1YAALRSIYHSEGHRGLFSGLTATLLRDAPP 
SGIYIJMFYKQTKNIVPHDQVDATLIPITNFSCGIPAG1IASLVT 
QPADVI KTHMQLYPLKFQWIGOAVTLI FKDYGLRGPFQGGI PRA 


' 6983 


82 


773 


EM S FLQDPS PFTMGMWS IGAGAIX^VAALAIjIJjANTDVPLS KPQ K 
AALEYIiEDIDIiKTI^KEPRTPKAI^WEKNGAVTMAVRRPGCFL 
CREEAADI>SSLKSMLDQLGWLYAVVKEHIRTEVKDFQPYPKGB 
IFLDEKKKFYGPQRRKMMFM3FIRI»GVWYNFFRAWNGGFSG1^E 

KPQTLASEKK 


6984 


1845 


1282 


GGRSAYSLPAGSbPRVPATAAAKMASGVQVADBVCRIPYDMKVR 
KC3TPKEIKKJUCKAVIFCLSADKKCI rVEEGKE riiVGDVGVTI T 
DPFKHFVGMLPEKDCRYALYDAS FETKBSRKEELMPFLWAPELA 
PLKSKMIYASSKDAIKKKPQGIKHKCQWraPElDlJmACIAEK^ 
GSLIVAPEGCPV 


69B5 


1887 ! 


1324 


RRTAGI YPCFPKPGRTRHAIjC2SVVI.L1jIjTGQIAFDD FQES CAMM 
WQKYAGSRRSMPLGARILFHGVFYAGGPAIVYYLIQKPHSRALY 
YKIAVEQLQSHPEAQEAI^PPLNIHYL^ 

IPVSGSKSEGLLYVHSSRGGPFQRWHLDEVFLEIiKDGQQIPVFK 
LSGEKGDEVKKB 


6986 


642 


1350 


YHLYFKMGDPNSRKKQALNRJjRAQLRlGnCES^ 
VPKEKKKKSALFEVSEVI PVMTNNYEENTLKGVRDSS YSLESSL 
EIMKDVVQUlAPRYQSMRRIWlGCrQBMDFlI^ 
LLFSRWKESDBPFRPVQAKFEFHHGDYEKQFLHVLSRKDKTG IV 
VNNPNQSVFLFIDRQHLQTPKNKATI PKLCSICLYLPQEQLTHW 
AVGTIEDHLRPYMPE 


6987 


1623 


341 


LEAAEKASRAPKE3QRQTDSKNYETEWWSPQKSQRRYBMYNTAC 
PU5BlEVGLYTIQII^LTPPPHI0amiSiaiHMVQPI,SGKWTI PP 
DPRNBCYUU^SKFTSHLKNI^SDI^CFDFFIDYKVOiLK>lRYTQ 
KEIABIMI^KKVSRCPT*KYTELFCHI4)PC^ 
RKKLBALRADRFAGLLEYLNPNYKEA 

KKPMTNE KQNS ILAN1 IJLSCLKPNSKLIQPT^TTLKKQLREVLQF 
VGl^SHQYPGPYFLACLLFWPENQEIJXJDSiajrEKYVSSrJ^RSFR 
GQYKRMCRS KQAS TI1FYLGKRKGI1NS I VHKAKIEQY FDKAQNTN 
SLWHSGDVWiCKNEVKDLLRRLTGOAEGKLISVEYGTEEKXKIPV 
I S VYS GPLRS GRNI ERVSFYLGFS IEGPPGL 


6988 


3 


689 


TQLLRRPAVPVGSAASGIRSGLWSASSGEWCAPAAGRAHAPVPR 
LWGLGAASTAAPQDAQTGPQPMPRADC I MRHLPYFCRGQWRG 
FGRGSKQLGI PTANPPEQVVDNLPADISTGIYYGWA5VGSGDVH 
KMWSIGWNPYYKNTKKSMETHIMHTFKEDFYGR.ILNYAIVGYL 
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Amino acid segment containing signal peptide 
(A=Alanine r C-Cysteine, D^Aspartic Acid, E= 
"Glutamic Acid, ^Phenylalanine , G=Glycine, 
H=Histidine, I-Isoleucine, K^Lysine , 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threon±ne , V*=Valiue, 
WaTryptophan, Y=Tyrosine, X=»tTnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\sapossible nucleotide insertion) 








RPBKNFDSLBSLISAIQGDlEEAKKRLEIiPEHtiKIKBDNFFQVS 
KSKIMNGH 


6989 


2 ■ 


1110 


LMPSDKPLS PSTHASAGSHCHAPPTTAJRRAFPIPFGSKSNMATL 
KDQLI YTlLLKEEQTPQrnCITWGVGAVGWACAI 5 ILMKDLADEL 
ALVD VI EDIO^KGEMMDIXJHGS LFLRTPKI VSGKD YNVTANSKLV 
I ITAG^QQEGESPJxNLVQKlWaPKPI IPZJVVKYSPNCKLLI V 
SNFVDILTYVAWKI SGFPKNR VIGSGCNLDS ARFRYLMGBPJjGV 
HPkSCTGWVLGEHGDSSVPVWSGMNV^ 

EOHKKVHKQWESAYEVTKLKGYTSWAIGLS VADLABS IMKNLR 

RVHPVSTmiOSLYGIKDDVFI^VPCILG^ 

KARLKKSADTIWGIQKELQF 


6990 


719 


258 


THASGMASWLALRTRTAVTSIJbSPTPATALAVRYASKKSGGSS 
KNLGGKSSGRRQGIKKMKGHYVHAGNI IATQRHFKWHPOAIIVGV 
GKNKCLYALEEGIVRYTKEVYVPHPRNTEAVDI*ITRLPKGAVLY 
KTFVHWPAKPBGTFKLVAMI, 


6991 


169 


■ 451 


RRSSDFHNPGFI^RPVSIjmNIHHQ\ncSTKNKRRNPKKIAYLL 

SSLLMTNLNPNESTENQPVI^YWAFT^ 

CGRHVH 


6992 


944 


510 


RQAPGCSSIAmQVRQVYOGLVRAPQVQTRPLSSRFVKRRGALY 
RSP^QSMPPPYPGPGPTAPYPPYPPQPMGPGPMGGPYPPPQGY 
PYQGYPQYGWQGGPQEPPKTTVYVVEDQRRDEliGPSTCLTACWT 
ALCCCOiWDMLT 


6993 


1 


374 


QWCVTCPQHNARQGPAVPPGI(jAYGAAPFKDLOVDFTEMSKCRG 

DRVMIIOTWNVASLCTI.WKGPQTVVI^PPTAVKVEGIPAWIH^ 

VKPAARBTWEARPSPDNPFRVTLKKTTSPAPVTPGS 


6994 


346 


1100 


QWPEKDPVMAASSISSPWGKOTFKA^LMVLVAfc 
RDFAPPGQQKREAPVDVLTQI GRS VRGTIiDAWX GPETMHLVSBS 
SS QVLWAI SS AI SVAF FAliSGlAAQM^lALGLAGDYlAQGLKliS 
PG0V(71YLLWGAGALVVYWLLSLLLGLVLALLGRI LWGT.KLVI F 
LAGFVALMRSVPDPSTRALTiTiTiftTJiILYAIiLSRIiTGSRASGAQL 
EAK\mGLERQVEELRMRQRRAAKGARSVEEE 


(S99S 


144 


1346 


GS VAVGlrSG IMAAQKDLWDAIVIGAGIQGCP^AYiniAKHRKRIIi 
I^EQPFTjPHSRGSSHCK^SRIIRKAYLEDFYTRMMHECYQIWAQI* 
KltKAGTQLHRQTGLIJ.I^MKKNQELKTIQANLSRQRVBHQCrvSS 
EELKQRFPNI RLPRGKVGLLDNSGGVT YAYKALRALQBA1 RQI>G 
GIVPJX5BKWEINPGI»I>VTVKTTSRSYQAKSLVITAGPWTNQriL 

HIYGLPTYJBYPGLMKVSYHHGNHADPEEPJ)CPTARTDlGDVQIIi 
SS FVRDHLPDLKPEPAVI ESCMYTNTPDBQP IIjDRHPKYDNIVI 
GAGFSGHGFKLAPVVGKILYBLSMKLTPSYDLAPFRISRFPSLG 
KAHL 


699<J 


■ 543 


1942 


BTANAEAAARKSAMDWKim^RRRIiATPmXrPNKXKSEQELKDEE 

MDIiFTKYYSEWKGGRKNTNEFYKTI PRFYYRiPAENEVLLQKLR 

EESRAVFIORKSREIiLDNEEXG^WFT^LDKHQTPPM IGEEAM IN 

YFJJPIjn/GBKAGAKCiCQFFTAKVFAKl^HTO 

VMRKVWIflQTRlGLSLYBVAGQGYIiP^SDLEify I LELI PTLPQL 

DGLEKSFYSFYVCXAVRXFFFFLDPLRTGKIKIQDILACSFLDD 

LLEIiRDP^r^KfiSQKTNMFSAPSALRVYjGQxIJ^ 

EEIiSRYGTATMTNVFIJXRVFQECXTYDGEMDYKTYLDF^ 

RKEPAAIXJYIFKLLDIE^KGYIJWSLlynrFFRAlQELMKIHGOD 

PVSFQDVKDBI FDMVKPKDPLKISLQDLINSNQGDTVTTI LIDL 

NGFWTYENREALVANDSKNSADLDDT 


6997 


370 


1104 


AMBLTI FILRLA1 YIIiTFPLYIiIiiFIiGLWSW I CKKWFFYFLVRF 
TVI YNEQMA3 KKRELFSNLQE FAGP SGKLSLLBVGCXJTGANFKF 
YPP^CRVTCIDPHPNFBKFLlKSIABllRHI^FFaFVVAAJGE^ 
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Amino acid segment containing signal peptide 
(A»Alanine, OCysteine, D~Aspartic Acid, E= 
Glutamic Acid, P= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K^Lysine, 
L^Leucine, M=Methionine f N^Asparagine, 
p=Proline, Q=Glutamine, R^Arginine, 
S-Serine, T=Threonine, Valine, 
W=*Tryptophan, Y=Tyrosine, X-Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=poseible nucleotide insertion) 








QVAIX3SVDVWCTLVLCSVKNQERILREVCRVLRPGGAFYFMEH 

VAABCSTWNYFWQQVLDPAWHLIiFDGCNLTRESWKAIiE^ 

LKLQHIQAPLSWEbVRPHIYGYAVK 


6998 


2 


616 


FVSRALLRVRSRRHPAEBRAAPGRPEDAPIECPGATNCPEPLWC 
SHLPVPYAPPTMESRGKSASSPKPDTKVPQVTimKVPPAADGK 
APLTKPSKKEAPAEKQQPPAAPTTA^ 

LKPAPTVPSSPnATPBPKGPGDGAERDEAASGGPGGRGPWSCEN 
FNPIiI>VAGGVAVAAIAI*ITiGVAFriVRKK 


6999 


14 




nPTXOJXfcrnPnTAM^ TRIES SDVXRLIMO YLKENSLHRAIiATLQE 

ETTVSLNTVDSIKSPVADINSGHWimnjQAIQSUa^DKTL 

YEQVVLELIELREIjGAARSIiIjRX^I^ 

LARSYFDPREAYPDGSSKEWlRAAIAjQAl^AGBVSVVPPSRIiMA^ 
DGQAI»KWQQHQGLLPPGMT IDLPRGKAAVKDVBEEKFPTQL3RH 
IKFGQKSHVBCARFSPDGQYIiVTGSVDGFTEVWN^ 
KY<^fcNFMMMDDAVI»CMCFSRmBM^ 

TLKB FRGHSS FVN RATFTQDGH Y I ISASSDGTVKI WNMKTTECS 
NTFKSIXSSTAGTDITVNSVILI^KNPBH^^ 
GQIVRSFS^KREGGDFVCCALSPP^EWIYCVGEDFVLYCFSTV 
TGKLEP-TLTVHE KDVIG IAHHPHQNLIATYSEDGLLKL WKP 


7000 


2 


827 


GPGVVFLBIjMESEGPPESERSEFPSQREEENEEEEAQEPEKTGP 
KNPLX&PAIiTGDVEGLQKI FETJPENPHHEQAMQLIiIiBED IVGRN 

llyjuvcmagqsdviraiakygvni^ 

etlkalveldvdiealnfreerardyaarysotecvkfi.dwada 

rltljocyiakvsijwtdtekgsgki^ 

ETHTEAS INELFEQRQQ1ED I VTPI FTKMTTPCQVKSAKS VTSH 
DQKRSQDDTSN 


7001 


2056 


B44 


RRCJjIIAFLKGCFIFIYPIFIFETEFIiSCCPGWSAVAQSRLIAN 

FASQVQAlPILPKDSQVGPDVKSEAAPKRAIiYBSVFGSGEI CGP 

TSPKPvI^IRPSEPVDAVVWSVKHDPLPIJ^ 

VSPAIVSPTQDSRPNMSRP1*ITRS PAS PLNNQGI PTPAQLTKSN 

APVHIDVGGHMYTSSIATLTlCYPESRIGRliPDGTEP XVLDSLKQ 

HYFIDRDGQMFRYIIOTLRTSKLLIPDDFKDYTIJUYEEAKYFQL 

QPMI^E^RWKQDRETGRFSRi'CECLVWVAPDLGEP^TLSGDK 

SLIEBWPEIGDVMCNSVNAGWNHDSTHVIJOT 

VLERLQORGPEIVGSCGGGVDSSQFSEYVIiRRELRRTPRVPSVI 

RIKQEPLD 


7002 


1043 


498 


PMPSSTRWTTS *TYTDTSSAWACRPTTGTCT* TAAPGPTVRWWP 
TPX3RHQSRRPO»TCWCSTSRP0GB*GGI^^TAPTRPTTSASSS 
SWTSAGTSWPAGRRTGTATSGTATTTSVWPGCGTRMWSTQWSSV 
PP^RSCC^P^ATTPPSKPGAPHAPCASSRHLAHGIiAPSSPGLPA 
RGAEVC 


7003 


818 


61 


QGRFPJVFCWQFJJFLQPPGTmiiSALIiAIiASKVTLPP 
GSVADKRKNP P W IRRRPVVVE PI S DEDWYLFCGDTVE ILEGKD A 
GKQGKVVQVIRQRNWVWGGLNTH YRYIGKTMDYRGTMI PSEAP 
IJjHRQVKLVDPMDRKPTBIBWRFTEAGERVRVSTRSGRI IPKPE 
FPRA1X1 1 VPK*rWI DGPKDTS VEDAL»ERTYVP CI^KTLQEEVMEAM 
G I KETR\NTPJRS IG I E PGABQI»LPNFC PS LEG 


7004 


121 


2285 


FIJjPVLTSRSLROPAVPtiAKLGGVEPAAMKaARAKTPRKPTVKK 
G\ PKRTliKTQIiG/ YYCRVRPIjGFP DQECC IEVINNTTVQLHTP E 
GYRLNRNGDYKETQ YS FKQV FGTHTTQKK LFDWAN P LVNDL IH 
GKNGLLFTYGVTGSGKTHTMTGS PGEGGKLPRCLDM I PNS IGSF 
QAKRYVFKSlTORNSMDIQCEVnM^ 

VDPEPADMITVQBFOCABBVDBDSVYGVFV^YIBIYIWYIYDLL 
EEVPFDPrNPNLHNLNCFVKlKHHNMYVAGC^^ 
VFWRGQKKRRIANTHLNRBSSRSIiSVFNI KLVQAPLDADGDNVL 
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Amino acid segment containing signal peptide 
(A=Alanine, Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine , G=Glycine, 
H^Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M-Methionine, N=Asparagine , 
P=Proline, Q^Glutamine , R=^Arginine, 
S=Serine, T=Threonine, V= Valine, 
^Tryptophan, Y-Tyrosine, X«unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\°possible nucleotide insertion) 








QEKEQIT I SQLSLVDIAGSERTNRTRAEGNRIiRBACTJ INQSIJ4T 
LRTCMDVIjRENOWGTNKMVPYRD^ 
VCVNPKAEDYEKrn^QVMRPABVTQI^EVA]^ 
YRNQPRGP\lGNEPLVTD\A^SPPPLPSCEILDINDEQTbPRL 

GKIjNBKBKMlSGQKLBIBRLEKKNlcri^^ 
KRNIiQQELETQNQKIX)RQFSDKRRLKAIUjQGMVTETTMKW^ 
ERRVAAKQLEM^0KIjWVia5EKLKX?I»KAIVTEPKTKKPBRPSRBR 
DRBKVTQRSVSPSPVPVSYL 


7005 


63 


876 


RNMALYQRWRCLRLQGLQACRIJITAWSTPPRWLAKRLGLFEKL 
WAAQVKPLASMAQKJBPRTIKISI>PGGQKIDAV7^WNTTPYQIJ\RQ 

FWHSSTHVI/3AAABQFIiGAVLCRGPSTBYGF 

G3ELPVIiERICQBLTAAARPFRRLEASRDQI^QLFia)NPFKim 

IEEKVTGPTATVYGOGTL\HM»CQGPHIjRH^ 

LWRSSG 


7006 


22 


898 


NAFGPJ&STAVKMAAAAWLQ^ 

VAAADRSKWH IPI PSGKNYFS FGKILFRIITTIFLRFDGBPCDIjS 

3 KLPQKCS EILFKTQTFSGDFMHRIiPIJCjGBKQEAKENGTNLiTFIG 
DKTAMHEPLG\FWQDAPYIFIVHIGISSSKESSI^ 
VEVKGPYEYIiTLEDYPLMIFFMVMCIV^ 
DLLRIQFWIGAVIKLGMLEKAVFYAGFQ 


7007 


2 


1001 


AMWStiPGTPKPRPATPGASSVEQIiRJCEGl^LFKCGDYGGAIAA 
YTQAUJLDATPQDQAVLHP^RAACHIJaiEDYDKAETEASIOVIEK 

I»RNiaGQIQEKVRYMS3TDAKVEQMFQII>LDPEEKGTEKKQKAS 
QNIiVVIiAREDAGAEKI FRSNGVQLIiQRLIJJ^ETDIJ^LAAIiRTL 
VGICSBHQSRTVATLSIIOTRRVVSIIiGVESC^VSIiAACmLQV 
MFDALKEGVKKGFRGKEGAI IVGEWiCQVPWIxLDVTVHEGMGLSQ 


7008 


70 


1478 


OISAIGHERPPPAHLPAGGRRI^CPRSCRWLGRPPSGLPPGPR 

SPPPI^PGQKMVQKKPAELQGFHRSFKGO^PFEIiAPSIiDQPDH 

GDSDFGLQCSARPDMPASQPIDI PDAKKRGKKKKRGRATDS FSG 

RFEDVYQLQBDVIjQEGAHARVQTC INL I TSQE YAVKI I EKQPGH 

IRSRVFREVFJ1LYQCQGHRNVLELIEFFBEEDRFYI*VFEKMRGG 

S I I^H IHKRRHFNEIiKAS VVVQDVASAIiDFLHNKG IAHRDLKPE 

NILCEHPNQVSPVKICDFDLG^IKlJ*3DCSPISTPEI.bTPCGS 

AEYMAPEVVEAFSEEASIYDKRCDLWSLGVILYTIiLSGYPPFVG 

RCGS DCGWDRGRACPACQNMLFES I QEGK YE FPDKDWAHI SCAA 

KDMSKIJ>VRDAKQRLSAAQVLQHPW^ 

WDSHFLLPPHPCRIHWPGGLWTVTVNK 


• 7009 


1 


626 


ARQLRNSWVDDFVAAPI*IPIiSGQI PTGNSLYESYYKQVDPAYTG 
RVGASEAALFLKKSGLS DI ILGKI WDLADPEGKGFI»DKQGFYVA 
LRLVACAQSGHEVTLSNLNLSMP PPKFHDTSSPLMVTPPSAEAH 
WAVRVEEKAKFDGI FESLLPlNGl^GDKVKPVIjMNSKI,PLDVI> 
GRVWDLSDIDKDGHLVRDEFAVAMHLVYRALE 


7010 


79 


571 


SHTRRAWPETLLS PLC^U^GGTAMSGGEQKPERYYVGVDVGT 
GSVRAAIiVI)G^GVLIiAF7\IXiPIKNWEPQFKHHEQSSE 
VVTKKWG^IDI^QI RGLGFDATCSLVVLDKOFHPLPV^ 
HRNVIMWLDHRAVSQVNRrNETKHSVLQYVGG 


7011 


3 


994 


RIQTLPNQNQSQTQPIJ^PPAVlK)PIAPQTTFGVG/rQPQPO^L 
LQAQISAAS ITPLLQTQPQPLLQQPO^KAGLLQPPVRIVSQPQP 
ARRLDPPSRFSGKNDRGDQVPWRia>DRSREREHBRRllSRERSPQ 
RKRSRBRSPRRBRBRSPRRVRRVVPRYTVQFSKPSIJDCPSCDtM 
ELRRRYC^YIPSDFFDAQFTWVI1AFPI^RPF0IX^C3*FYV^ 
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Amino acid segment containing signal peptide 

l ZV"Al aril TIP* r-Pvnhpiiif* V\— AQnavhi r*» 2\r»"i t? — 
\n^rucuuuCf k»-uyoLcXUC| U^ASpaLblC ACIm/ J5— 

Glutamic Acid, F«Phenylalanine, G=Glycine„ 
H=Histidine, I=Isoleucine. K=I,ysine, 
L-Leucine, M=Methionine, N^Asparagiue , 
P= Proline, Q=Glut amine, R^Arginine, 
S^Serine, T=Threonine, V=Valine, 
W^Tryptophan , Y=Tyrosine, X=Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








RETVESLEKNMA1LDPPDADHLYS 

ABDPQBUtDGFQHPAmiVKFliVGMKGKDEAMAIGGHWSPSLDGP 
DPEKDPSVLIICr\AIRCCKAI>TG 


7012 


1 


2661 


RRAGSVKRGKAJILFGPTERQSERPI^PSAAI^PSMLSGKKAAAA 

AAAAAAAATGTEAGPGTAGGSENGSBVAAQPAGI^GPAEVGPGA 

VGRRTPRKKEPPRAS PPGOLAB PPGSAGPQAGPTWPG5 ATPME 

TGIABTPBG\RRTSRRKRAKVEYREMDBSLANI*SEDEYYSEEKR 

KAKAEKEKKLPPPPPQAPPEEKNESBPBBPSGVBGAAFQSRLPH 

DRMTSQEAACFPDI ISGPQQTQKVFLFIRNRTLQLWLDNPKIQL 

TFBATIiQQI.RAPWSDTVLVHRVHSYLEPJIGLINFGIYXRIKPL 

PTKKTGKVI I IGSGVSGLAAARQl^FGMDVTLLEARDR VGGRV 

ATFRKGNWADLGAMVVTGIjGGN^ kqkcp 

LYERNGQAVPKEKDBMVBQBFKRlJ*BATSYLSHQIiDF 

VSLGQALBWIQI^RKHVKDEQIBHWKKlVjm?BELKEIiLNKMV 

NI*KEKIKBLHQQY1<EASEVKPPRDITAEFLVKSKHRD 

YDKIAETQGKLEEKLQEIjBANPPSDVYLSSRDRQIIJDWHFANLK 

FANATPLSTLS LKHWDQDDDFB FTGSHLTVRNGYSCVPVAIiAEG 

IiDIKLNTAVRQVRYTASGCEVIAVNTRSTSQTFrYKCnAVLCTL 

PJ^VI^QQPPAVQFVPPLPEWKTaAVQP^FGNI^VVLCFDRV 

PWT)PSVNLrarVG£TTASRGELFLFWNLYKAPIT.LAl,VAGKAAG 

IMENIS^DVIVGRCIiAIUCGIPGSSAVPQPKETVVSRWRADpWA 

RGSYSYVAAG3SGM)YDliMAQPITPGPSIPGAPQPIPRIiFFAGE 

HT I RNYPATVHGALLSGLREAGR IAIX3FLGAMYTLPRQATPGV P 

AQQSPSM 


7013 


1 


2661 
- 


RRAG3VKRGEARIjFGPTKRQSERPIJIPSAARRPE>1I>SGKXAAAA 
AAA AAAAATGT EAG P GTAGGSENG S EVAAQ PAG L SG P AE VG PGA 
VGERTPRKKEPPRASPPGGLAEPPGSAGPQAGPTWPGSATPME 
TGlAETPEG\RRTSRRIOlAKVEYREMDESljANIi3EOEYYSEEER 
l!»KAEKBKKLPPPPPOAPPEEBNBSEPEEPSGVEGAAFQSRIiPH 
DRMTSQBAACFPDIISGPCKWiKVFIiFIRBmTI^IiWLDNPKlQI, 
TFEATIxC^LEAPYNSDTVLVHRVHSYLBRHGLINFGIYKRIKPL 
PTKKTGKVI 1 1 GSGVSGLAAARQLQS FGMDVTLLEARDR VGGRV 
ATFRKGNWADIjGAMVVTGLGG^JPMAVVS KQVNMELAKI KQKCP 

T Vl?7vitrr*r^TVxrnirT7i/T~»t?Mt ro/\t3t7vnri* y DnTCvr f*iirvT TMnttrruvTun 
1j X ISAWtsyA V V KJiKJJKM VEyK I^HULKATli X l*imQIjDFNVIJ*NKP 

VSI/^ALEWIQI^EKHVKDEQIEHWKKIV^ 

NLKEKI KELH0X)YKEASEVKPPRDITAEFLVK3KHRDLTALCKE 

YDELAETG^KI^EKLQEIjEANPPSDVYI^SRDRQILDWHFA 

FANATPLSTI^LKIIWDQDDDFEFTGSEirVRNGYSCVPVAIiAEG 

LDI KIjnTVVRQVRYTASGCK VI AVNTRSTSQTF I YKCDAVLCTL 

PLGVLK0X3PPAVQFVPPI,PEWKTSAVQRMGFGiniNKVVIX:Ft)RV 

FWDPSVNI>FGHVGS ITCASRGEI^FTiFHNIjYKAP ILIJUjVAGEAAG 

IMBNX SDDVIVGRCLAILKGI FGSSAVPQPKBTWSRWRADPWA 

RGS YS YVAAGSS GNDYDLMAQPITPGPS I PGAPQP I PRLFFAGE 

imRNYPATVHGALLSGLRKAGRIADQ 

AQQSPSM 


7014 


3 


3350 


DFFAVGDECmiLATLKlX5WLEGSI^GRTGlFPyKFVKl,CPin'RVE 
ETMALPQEGSIiARlPETSLDCL^ITljGVEEQRHETSDHEAEBPD 
CI ISEAPTS PI^HTjTSE YDTDRNSYQDEDTAGG PPRS PGVEWEM 
PIATDSPTSDPTEVVNGISSQPWPraPl^ICSQYYSTVGGSHP 
HSEQY PDLIiPLEARTPJ^YASLPPKRF^SOiiKTLQKPVLPLYRGS 
SVSASRWKPRQSS PQLHNLAS YTKKHHTSSVYS ISERIiEMKPG 
PQAQGLVMEAATHSOGDGSTDIiDS BdiTQQIj I EFEKSLAGPGTE P 
DKILRHFSIMDFNSEKDIVRGSSKI.ITKQKLPERRKAT1RPPPPR 
PCTPVSTS PHLLVDQNIJO?APPLVVRPSRPAPLPPSAQQRTNAV 
SPKIiSRHRPTCETIiEKEGPGIIMGRSIJXyrSPCPLVLVRlKKME 
RDI^MYSRAOBEI^MI^KQDBSSPAP^EDLKFCF^NIF^LN ' 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, OCyeteine, D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 

Leucine , M=Methionine, N-Asparagine , 
P» Proline, Q=Glu t amine , R=Arginine, 
S=Serine, T=Threonine, v=Valine, 
W=Tryptophan, Y -Tyrosine, X=Unknown, *~Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








MELQQIiRffllTIiLSSQSSSLVAPSGSVSAENPEQRMIiEKRAKVIE 

ELLQTERDYIRDI^CIERIMVPMQQAQVPNIDFEGLFGNMQMV 

IKVSKQIiLAALEISDAVGPVFIXSHRDELBGTYKIYCQNHDKAIA 

IiLEIYEKDEKIQKHbQDSLADLKSLYireWGCTNYINLGSFTjIKP 

VQRVMRYPLI^BLLNSTPKSHPDK^LTNAVIAVKB INVNINE 

YKRRKDLVLKYRRGDEDSLME KI S KLN IHS I IKKSNRVS SHLKH 

LTGFAPQIKDEVFBETE KNFRMQERLIKSPIRDL3LYLQHIRES 

ACVKVVAAVSMWDVCMERGHRDIiEQFERVHR 

TERLVISPl^QLI^Mrn>PHKLVQKRFDKIJjDFYNCTERABKLK 

DKKTLEBLQSARNNY^ALMRC^DKLPKFHQYAQ^ 

AEAHC2)FVHQALEQI^PIiIjSIjI>KVAGUBGN1j iaifh eehsrvlq 

QliQVFTFFPESLPATKKPPERKTIDRQSARKPLLGLPSYMLQSE 

ELRASIXARYPPBKLFQAERNPKAAQDI^VSIiLEGDLVGVI KKK 

DPMGSQNRWL I DNGVTKG FVYSS FLKP YNPRRSHSDAS VGSHSS 

1ESBHGSSS PRFPRQNSGSTUTFNPN\S\MAVS FTSGSCQKQPQ 

K^PPPKEWDOGTLSASLNPSNSESSPSRCPSDPDSTSQPRSGD 

SADVARDVKQPllATPRSYPJ^FRHPEIVGYSArPGRNGQSQDIjVKG 

CARTAQAPEDRSTEPDGSEAEGNQVYFAVYTFKARNPNELSVSA 

unirr trrr RRimvTfilsrrKiWIiAKVNGiGCGYVPSNYIRKTEYT 


7015 


1B42 


513 


RQAVf^B\VAAPSWRGARI*VQSVIjRWQVGPHVARERVI P FSSIiL 

GFOJU^CVSCVTVGSAPSGPRIASASRSNGQGSALDHFLGFSQPDS 

SVTPCVPAVSMNRDEQDVIiLVIIHPDMPBNSRVLRVVIiLGAP 

KSTLSHQIiLGRKVTPVSRinraTTRCQA^ 

GI IS PGKQKRHHIiELSLLEDPWKSMESADLVVVLVDVSDKWTRN 

QI>SPQIiLRCLTKYSQI PS VLVMNKVDdiKQKSVT,LEjQ'XAALTEX; 

VVNGKKIiKMRQAFHSHPGTHCPSPAVKDPNTQSVGNPQRIGWPH 

FKEI PMLSALSQBDVKlXKQYliLTOAQPGPWE YHSAVLTSQTPE 

EI CANI IRBKIiLEHIiPOBVPYNVQQKTAVWEEG PGGELVIQQKL 

LVPKESYVKIiLIGPKGHVISQIAQKAGHOI^lDIFLCDVDIRLSV 

KLLK 


7016 


167 


2513 


ILNAPKPPPPRDSVEAVAAKRDTGGGSWGTC 
AFRQKIiVSQIBDAMRKAGVAH^KSSKDMESHVFLKAKTRDEYLS 
LVARLI IHFRDIHNKKSOASVSDPMNALQSLTGGPAAGAftGIGM 
PPRGPGQSLGGMG SliGAMGQPMS LSGQ P PPGTSGMAPHSMAW5 
TATPC/rQLOrWVAAAAAAATARSSSSSSRRRYSSSSSSSNSKQ 
FQAQQSim)Q\QFQA\vVQQ<^l*\QQ^ 

PMQQPQPPPSOAL^OIiQ^HHTOifflQPPPQPOXJPPVAQNQPSQ 
LPPO^O^PLVSC^ALPGQMI»YTQPPriKFVKAPMVVQQPPVQP 
QVQQQOTAVO/nVOJU^QMWuTGVQVSO^ 

QSMPPPPQPSPQPGQPSSQPUSNVSSGPAPSPSSFLPSPSPQPF 

\OSPWARTP0NFSWSPGPLNTPVNPSSVMSPAGSSQAEEQQY 

I^KLKGJ^KYIEPLRRMINKIDKNBDRKKD^ 

SK^CPLIO'I^KCBIAl£KLKNDMAVPTPPPPPVPPTKQQYLCQP 

LLDAVI^IRSPVFmSLYRTFVPAMTAIHGPPITAPVVCTRKR 

PJ^DBRQSIPSWiGEVARl^PKFLVfll^PSHCS^ 

KLDDKDIjPSVPPI^LSVPADYPAQSPIiWIDR(2WQYDANPFLQSV 

HRCMTSRI^LPDKHSVTAIJiWTWAQSTOOACl^AA 


7017 


1 


1785 


im^^CYMNSVI*ALFMATDFRRQVX*SI^LNGCNSLMKK^ 
PAEIAHTQRBAYAPRIFFmSRPPOTTPRSQODCSEYLRFLLDR 
LHEEEKILKVOASHKPSEILECS ETSIiQEVASKAAVLTETPRTS 
DGEKTLIBKMFGGKLRTHXRCLNCRS^ 

YSLEYMSCPDCSQSPSIQDGGLMQASV^PSEEPWYWPTIAAP 
ICIJSLVNEKTIGSPPNEFYCSENTSVPNESNKILVNKDVPQKPG 
GKTTPSVTDIiLNYFLAPE IKTGDNQYYCKNCASLQNARKTMQIT 
EE PEYL I LTLLR FS YDQ KYHVRRKI LDNVS LPL VLEL PVKRITS 
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SEQ 
ID 
NO; 


Predicted 
beginning 
nucleotide 
location 
cone spon ci ing 
to first 
amino acid 
residue of 
amino acid 
secjuen ce 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, O Cysteine, D=Aspartic Acid, E* 
Glutamic Acid, P=Phenylalanine, G=Glycine, • 
H-Histidine, I=Isoleucine, K>= Lysine , 

T, — TiPMIfl T°l F* M — Mpt'hl'nn'tno W-l cna v^rr'i via 
ii-UCUUJUIC| ne L-IlJLvJlI-HlfcJ / K-iiSpaidy IDG , 

P^Proline. CNGlutamine, RoArginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, X-Unknown, +=Stop 
Codon, /=possible nucleotide deletion, 
nuitieoLXuc insertion ) 








FSSI^BSWSVDVDFTDI^ENlJUaa»KPSGTPEASCTKLVPYl^S 

SVVVHSGlSSKSGHYYSYARNITSTDSSYQMYHQSEALAtASSQ 

SHLLGRDS PSAVFEQDLENKRMS KEW PLFNDSRVTFTS FQSVQK 

ITSRFPKOTAYVLLYKKQHSTNGkSGNN^ 

M«DAITKDNKI»YI<iEQELNAHAHALQAASA8CSF^ 

Ffcbt^FTW»UGGGlJrNTvGR 


70X8 


484 


1066 


SLVFRGNTW3GEAGHHCSALFNIAAYHQLFVGTERIRAPEIIFQ 
PSLTGEBQAG IAETLQYIIiDR Y PKDVQEMLVQN^FI»TGGNTMy P 
GMKARMBKHLLEMK P FRSS bWQLA5NPVLl)AWYGARDWALN3JL 
DDNEVWITRKE YEEKGGE YLKEHCASNI YVP IRLPKQASRSSDA 
QASSKGSAAGGGGAGEQA 


7019 


1048 


335 


APGGFLVTMVFPAPS P PWMLQCCSHBVTAGP PTLCKDMS AL.VAA 

RMRHIPLAPG3IMRDLPNIEVRI*SIXnT4ARKIiRYTHHDR 

SSGAI^GVCSCVEAGKACDPAARQFNaTjIPWCI^ 

GLYGRLEWIXSFFSTTVTl^EPMGKCK^Vr^PKQHRW^ 

SG^FPOTYRIjPGNILDKHRQVGNAVPPPLAKAIGIiE 

ARESASAKIKEEEAAKD 


7020 


1 


2154 


FADSKRKSVLLDK1 KNLQVAI^TSKQQSLET AMSFVARNTFKRVR 
NGPUlRKVAVFFSin'PTRASPQLREAVLKLSnAGITPI^ 
DRQlilN^IHNmVGHALVLPAGRDLTDFLENVIiTCHVCLDIC 
NIDPSCGFGSWRPSFPwDRRAAGSDVDIDMAFILDSAETTTLB*QF 
NBMKKYIAYLVRQLCWS PDPKASQHFARVAWQHAPSES VDNAS 
MPPVKVEFSLTDYGSKEKLVDFLSRGMTQLQGTRALGSAIEYTI 
EtTVFESAPNPRDIiKI VVI^«TGEVPE0X3I>EEAQRVI LQAKCKGY 
FFVVI^rGRKVNIKBVYTFASBPNDVFFKLVDKSTELNEEPUW 
FGRLLPSFVS SENAFYLSPDIRKQO^FQGDQPTKNIjVKFGHKQ 
VIWPNNVTSS PT9NPVTTTKPVTTTKPVTTTTKPVTTTTKPVTI 
INQ PSVKPAAAKPAPAKP VAAKP VATKTATVRPP VAVKP ATAAK 
PVAAKPAAVR P P AAAAAKPVATKP EVPRPQAAKPAATKPATTKP 
MVXMSREVQVFBITENSAKLHWERPEPPGPYFYDLT^SAHIXJS 
LVLKQNLTVTDRVIGGLLAGQ'rYHVAWCYLRSQVRATYHGS FS 
TKKSQPPPPQPARSASSSTINIJWSTEPLALTETDICKLPKDEG 
1 l-KUt 1L»KW z r DPN ITCii LAKFW Y GGCGGNKNKFQSQKBCEKVuA 
PVLAKPGVISVMGT 


7021 


2 


338 


VNAVSFFPlTCYAFATGSDDATCRIiFDIiRADQEIil^ 
ITSYAFSKSGRIiIJLAGYDDFNCNVmmiK^ 
CLGVT DDGMAVATGS WDS FkRIWN 


7022 


2 


856 


VYIGS FWSHPLJj I PDNRKLFFJIEEQDLFRDI QS LPRNAAI*RKLN 

|Yf TIfDIlDT 7V Wl TU7L VTTCCT It V L'MDC^fPfiynMirYypr ttvttjt JVt*TV 
IJJLiXlVtUuujAlLVllflXXXooljAA 

GRIEREHQIS1>GDFPNLKRMQBQLQAQDFSKFQPLKSKLLEVVD 
DMLAHDIAQLMVLVRQEBSORJPIQMVKGGAFEGTL^ 
EGAQEGIDDAEWVVARDKPMrl7EIFYTr£^ 
VRSKLPNSVtiGKIWKLADIDKDGMl^DDEFALANHLIK^ 
ELPNE LPAHLLP PSKRKVAE 


7023 


2 


748 


AMVFGGWPYVPQYRD IRRTQNADGFSTYVCLVLLVANI T.RI LF 

WFGRRFESPLLWOSAIM I LTMLLMLKLCTBWVANELNARRRSF 

TAADSKDEEVKVAPRRSFIjDFDPHHFWQW 

VAGYITYI^IDSALPVETLGFIj^^ 

EGMSIKrmiMWTSGQAPlClTVYF^nKGAPM 

ILGQAYAFARHPQKPAPHAVHPTGTKAL 


7024 


1207 


190 


RTCVTGWAQVWMFGGGGVLSSGEQLQMPVKPERGLGPSDGWLV 

SSRRGSPGTVLGLPFWLLTPVLVSRSIRSMLLLTRSPTAWHRLS 

QUOPPVLPGIT^GO^IIJISWIJ^ROGPAETGGO^ 

RliLITGLFGAGLGGAHLALRAEKERIiQO^KRT^ 

FHLLDHRGRARCKADFRGQWVlMYre 

VRQLEAEPGLPPVQPVFITVDPERIJDVEAMARYV^DFHPRLLGL 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, B- 
Glutaraic Acid, Phenylalanine, G=Glycine, 
H=»Histidine, I=Isoleucine, K=Lysine, 
L=Leucine f M=Methionine, N«Asparagine , 
P*=Proline, Q=Glut amine, R-Arginine , 
S-Serine, T=Threonine, V=»Valine, 
W=Tryptophan, Y=Tyrosine, X-Unknown, *«Stop 
Codon, /-possible nucleotide deletion, 
\~possible nucleotide insertion) 








TGSTKQVAQASHSYRVYYNAGPKDBDQDYIVDHSIAIYLLNPDG 
IiFTDYYORSRSAEQI SDS VRRHMAAFRSVI>S 


7025 


232 


832 


ERNSP IGNNKNIi*K\HSLBCLCFRGDWEGNU > QPQTI>QDNQBJECF 
KQVT RTCE KRP TFNQHTVFNIJIQRIjNTGDKLMEFKKLGKAFI SG 
SDHTQHQLIHTSBKFCGDKECGNTFLPDSEVIQYQTVHTVKKTY 
ECKECGKS FSLRSSLTGBKRIHTGEKPFKCKDOGKAFRFHSQLS 
VHKRIHTGBK6YECKECGKAFSCG 


7026 


328 


1146 


NPNPSIGDIKDIKKAAKSMLDPAHKSHFHPVTPSLVPLCPIFDG 
LHQAM>S VGVS KRSIfTWGNENEERGTPYASRFKDM PNP IALE K 

HGBSADLLISCNAESAlGHXSSRPHVGBIMFTPIiFGDFBS PIJIK 
LRXSS*LPRKHR*QPINAVPJ^FIJ>QCMDGSIALRAIVSEIPVFR 
BKKNHG*KGlGBIF*VWGCn^PHxT^VrmVPKI^N^ 


7027 ^ 


43 


954 


GIUUiQQQQRPEDABDCAEGGGKRGKAGWBGGYPBIVK^ 
YYQELKTVTBGEWGQF^AIjKEPLPATIjR ITGYKSHAKE I LHCL 
KNKYEKEIi£DLEM)GQKVEVPQPLSWYPERIAW^ 
SPHLEKFHQFI*VSETKSGNISRQEAVSMIPPLLLNVRPHHKILD 
MCAAPGSKTTQLIEMLHADMNVPFPFXJFVI ANDVDNKR CYIiLVH 
QAKRIiS S PCIMWNHDASS I PRLQ I DVDGRKE I LFYDRI LCDVP 
CSGDGThti^lDVWKKWrTIlJSI^^ 


7028 


189 


608 


SRPPPEPEPGTMVEKGSDSSSBKGGVPGTPSTQSlJGSkNKI RNS 
KKMQSWYSMLSPTYKQRNEDFRKIiFSKbPEAERLIVDYSCALQR 
BILIiQGPJjYLSENWICFYSNIFRWETTISIQLKEVTCLKKKKTA 


7029 


1343 


40 


VLRSWTEAKQATCTSSKLRHGTGQEKGREGPRCPSGIAQIiRLWG 
/PCPHAGRETGPRASAPI PGS*GHGWHW*RKDGRGERSEGPSAIi 
SPHSPSLLNMQQAPTHVGPGI^GSORPRSSVVPEQVGVGSOLSRE 
RWRA* RSI^GAAASFJn , EMTKFJ^p/RPCXKm)SSNWFTQPGKK 

R*GKGGpRSPLTKA3GTTHI PTPFFGS I P /RPTRDSGPGTDNS \ 

aapgqkrghrea*qgpepv/wgrvtthlqgpag*tkplcjs\rnw 

VPGPAEGEQGEGAGLEGRP * PLKGCRSTLTFSPQLSI PMVGKKP 
PEGTTASFPP\RSC!HSE*RKPPPSCPHAPALSLPHPl J PLPIiPPL 
PLPI^AGT*HSARSGPJH3QSETGSLaiNCHHCPPHCPKCSPGG 
T 


7030 


2 | 


521 


FVCFSAPGSGQGGKRRVNMEIiSAVGERVFAAEAIJJCRRIRKGRM 
ra.VKWKGWSQKYSTWKPBENXIX^APXIJVAFBEREREMELYGPK 
IO?GPKPKTFIirj<A0AKAKAKTYEFRSDSARGIRI PYPGRSPQDL 
ASTSRAREGLRN \RVCPRQRAAPAPAAP\ PRRGP SGPGPRPC * G 
PGLHFPGPGGPSKiiGFVPASEQHQIIQQHLPRRGPSGPGPRPG 


7031 


960 


59 


HCSVPGAEWPRKPPAQICPQLTSRPHIiSSPRSLSPGOGHSPGPG 
/CKPS /RHCDEIiHEGPSRTAALPX3GKPQPKHGVEECG/PCPCIA 
PRRLTE P PALTVSP VGRAAPSGAL * PSGRACSACSHRLAPEAAL 
S AAAPRPS LGSGQNASGLPAASIipPQDS SQPHKTVP S PARS VP P 
LGAQARAAPPRLWCPRALVSG* EASPEAVSVAAGPPVPGPTPST 
SGSTASHSRRGC*SPR*TPAPPRRI^RSAAFEVLTAAASAQPC 
ASQGGPRPTGAGRTPSPLGIkPFSRGPPAASARPFCRHPSL 


7032 


1393 


2104 


RRPGRTEPVEPpPVPPPPRASNSKSRCR*RNtHLAPI**QSPLRK 
SRQIGTSSLPFGRSAGERPRPAATFCLSRGGSSPVF1.+PSSSSL 
EPWMKRQFGRLHSLFWKSWQKMNS PL7*TPKLDTSLMSGWRYRQR 
LPRLHTELKKSLQMASELAPPLPTPAPLASSLPPPPGPPPLLPV 
PIA*LSRSGILVPPNSGFSLSC\PLGDH*GSSGEVRGSCGSPPP 
HHCWVLPPPP*LLLPPR 


7033 


689 


815 


RSRDClLSSSAtSNRARRSKCSGPKRATPLDSGPGP*APPGPSSA 
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SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C= Cysteine, D=Aspartic Acid, 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=.Histidine, I-Isoleucine, KwLysine, 
L= Leucine , M=Me th i onine , N=Aspa r agine , 
paProline, O-Glut amine, R=»Arginine, 
S=Serine, T^Threonine, V=»Valine, 
W=*Tryptophan, Y-Tyrosine, X=Unknown, *^Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








UmPSSCS^TGALGPSPAGSRALGRCTSSVGPGSRWLTRTSSP 
GCATRTWRTMRMBPRPLRSRMGKSAPGIPAELPSAAPSGPSAPS 
AAAPSAPTTPAAAGPMTL*SRRTAEWCWPPSCSCGWGWC*SWSA 
WDWRl^PIiQVSPAPSSSCRASCCWCI^IT+SSSTARSRATGAS 
SSSTCPTSRSDRGAAWTPXSPMGAPLIiPCSVPIjISREEALODPR 
NPSP*GVCSGSSGHAG1ALGKPPVACSVP 


7034 


92 


1942 


EDTSSMPFRLIftI PMSIiLCAXJjPQHHGAPGPIXJSAPDPAHYRERV 
KAMFYHAYDSyi>BNAFPFDKl^PLTCXGHDTWGSFSLTLIDArJ3 
TLL\TLPYFQILGNVSEFQRVVEVLQDSVDPDIDVNASVPETNI 

FQTPTGIWYGTVNIiHGVm»GBTPVTC1^ 
TGDPVFEDVARVAU4RLW BSRSU I GI*VGNTIIDVLTGKWVAQDAG 
IGAGVDS YFE YLVKG&ILLQDKKLMAM FLBYWKATRNYTRFDDW 
YLWVQMYKGTVSMPVFQSLEAYW PGLQSLIGDIDNAMRTFLNYY 
TVWKQFGGIiPEPYT«PC^YTVKKREGYPIJlPELIESAMYLYRAT 
GDPTIJjEIiGRDAVES IEKI SKVECCFATIKDLRDHKXJDNRMES P 
FLAETVKYLYrXFDPTNFIlO^STPDAVITPYGECILGAGGYT 
FNTEAHPIDPAAiaCCQRLKEEQWEVEBLMRBFYSLKRSRSKFQ 
KNTVSSGI>WEPPARPGTLFSPENHI)Q^ERKPAKQKVPl»IiSCPS 
QpFTSKLAIiIiGQVFriDSS* PIJ^FFIFIFLRLNYNKLLIiAIIKK 
K 


7035 


92 


1942 


EDTSSMPFRLLI PLGLLCAIiIiPQHHGAPGPDGSAPDPAHYRERV 
KAMFYHAYDSYliBNAFP FDELRPLTCDGHDTWGSFSLTLIDAI*D 
TLL\TLFYFX)IU^SEFQRVVEVLQDSVDFDIDVNASVFETWI 
KVVUtjlji^AnjjIjSKKifcVJSVKAU^ 

FQTPTGMPYGTVNI>LHGVNPGETPVTCTAGIGTFI VEFATLS SL 
TGDPVFBDVARVAIjMRIiWESRSDIGI»VGNHIDVLTGKWVAQDAG 
t nigrum a vt?pvt \r win tt .t j\t\wt .mrmut .KVMTf a t dwvtr pnnuL 
YIiWVQMYKGTVSMPVFQSLEAYWPGl^SlilGDlDMAMRTF 
TVW KOFGGliPEFYNI PQG YTVE KREG Y PI»RPELIES AMYLYRAT 
GDPTTiIiELGRDAVES IEKISKTI/ECGPATIKDLRDHKlJJNRMESF 
FrJu^TWCYLYI*LFDPTKn?IHNlTC 

FNTEAH P I DPAALHCCQRLKBEQWEVBDLMRBFY SLKBSRS KFQ 

KNTVSSGPWBPPARPGTLFSPFJ^HBQARERKPAEQKVPIiSCPS 

QPFTSKIJUJ^VFLDSS*PLDNFTIFIFLRLNYNK^^ 

K 


7036 


442 


761 1 


CLAPLFSCt\iIINLHLAPSGRIJiWAWIiRGPGRN*LPGKGPSIPT 
RNW* ERKAGCSQPC/ PAQQHHGRPPGVS PLPRDPHPTTLR PL.PP 
PPPPPPPPPRRPPRHRRPG 


7037 


442 


761 


OJ^LFSCFQIINLHLAPSGRLRWAWLRGPGRN*I>PGEGPSI PT 
RNW*BRKAGCSQPC/PAOJHHGRPPGVSPIiPRDPHPTTIjRPIJPP 
PPPPPPPPPRRPPRNRRPG 


7038 


155 


891 


GAGAASMISSGLRAADFPRWKRHISEQLRRRDRLQROAFEE 1 1 l 

QYNKLLEKSDLHSVliAQKIiQAEKHDVPNRHE I S PGHDGTWNDNQ 

LQEMAQLRI KHQBELTFXHKKRGFXAQ\RVIDLNNQMQRKDREM 

QMNEAKIAEC&Q/riSDLETBCIiDLRT^ 

LQI TFTALEGKLRKTTEF^QELVTR WMAEKAQEAN^ 

LQEAASPAAERACRS3KGT3TSRTG 


7039 


155 


891 


G AGAASDMSSGLRAADFPRW KRH I S EQLRRRDRbQKQ; A FEE I I L 
QYNKLLEKSDLHSVLAQKXQAEKHDVPNRHEISPGJffiGTWNDNQ 
LQEMAQLR I KHQE EliTELHKKRGE LAQ \ RV I DLNNQMQR KDREM 
QMNEAKlJuSCLQTISDLJSTECLDLRT^ 
I^ITFTALEGKLRKTTEENQELVTRWMABKAQEANRLK^ 
LQEAASPAAERACRS SKGTSTSRTG 


7040 


34 


789 


KXTP PRRPHRCSSGHGSDNSS VLSGEIiP PAMGKTALFYHSGGSS 
G YESVMRDSEATGSAS SAQDSTSENSSS VGGRCIRSIiKTPKKRSN 
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ID 
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Predicted 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C=Cysteine, D=Aspartic Acid/ K- 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=I>ysine, 
L=Leucine, M=Methionlne, N=Asparagine, 
P«Proline, Q=K5lutamine, R=Arginine, 
S=Serine, T^Threonine, V=Valine, 
W»Tryptophan, Y=Tyrosine, X=Uhknown, *=Stop 
Codon, /"-possible nucleotide deletion, 
\=possible nucleotide insertion) 








PGSQRRRLI PALSIiDTSS PVRKPPNSTGVRW VDGPLRSSPRGLG 
EPPB1KVYEIDDVERLQRRRGGASKEAMCFNAKLKILEHRQQRI 
AEVRAKYEWIWKELFATKQY]^1^PNKWLSEPDI^QVWEIJ)SLK 
YI^ALECVTERLBSRVNPCKAHLMMITCFDrT 


7041 


l 


5*7 


SGRVAMGRRRAPAGGSLGRALMRHQTQRSRSHRHTDSWIJiTSEIj 
NDGYD WGRLNLQS VTEQSSLDDFIATAEIjAGTE FVAEKLNI KFV 
PAEARTGIjItSFEBSQRIKKLHF^NKQFUriPRRPI^^ 

lkoj\ekdnflewrrql\vrleeeqkliltpfkrnlofwrqlwrv 
iersdiwqivda 


7042 


7 


345 


PIHMAA7UVIJMl\iSPl^PHIO^YLI^ASHG\ATSI>HTKGAIi 

PLETVTHYTVlPKSKYVLVKPI^^ 

DDIJjMABVAI SDYGDKLTLBIiREKY 


7043 


2 


2170 


ARGMAARDSDSEEDLVBYG1X3LKPUBEGKRPKKPIPLQIXJTVRD 
BKGRYKRPHGAFSGGFSAGYFNTVGSKEGWrPSTPVSSRQNRAD 
KSVLGPBDFI^EEDIjSEFGIAPKAIVTTDDFASKTKDRIREKAR 
QLAAATAP I PGATLIjDDIJTPAKJ>SVGPELIJ?KMGRKEGQGVGP 

rvkrrprrqkpdpgvkiygcalppgssbgsbgedddylpdnvtf 
apkdvtpvdftpkdnvhgiaykgi^phqalpgtsgmifnlpsgg 
seragdi/?biglnkgrku3isgqafgvgalebedddiyatkti,s 

KYDTVLKDEEPGDGLYG>n7u?RQYKNQKESKKDt 1 RYVGKI LDGF 
SIASKPI^SKKIYPPPEIiPRDYRPVnYFRPMVAATSENSHLLQV 
I»S ES AG KATPDPGTHSIQIQLNAS KRAKIjLGETP I QGSATS VLB F 
LSOKDKERIKEMKQATDLKAAQI»KARSLAQNAQSSRAQPSPAAA 
AGHCS WNMALGGGTATLKASNFKPF AKDPE KQKRYDEFLVHMKQ 
GQKDALERCLDPSMTKWERGRERDEFARAALLYASSHSTLSSRF 
THAXERT>DSDQVEVPRDQENDVGDKQ^ 

IIPDKLLFQ /PJbVGLPRVKRDKYS VFNFLTliPETASLPTTQASSE 
KVSQHRGPDKSRKPSRWDTSKHBKKEDS ISEFLRIiARSKAEPPK 
QQSSPLVNKEEEHAPELSAN 


7044 


276 


734 


EVYLTDEFAKGRKVADLYELVQ YAGNI IPRIiYLLITVGVVYVKS 
FPQSRKDI LI^LVEMCRGVQHPl^GIiFliRimiQCTRNlLPDEG 
EPTDEETTGDISDS^FVIiLNFAEMNKLWVRMQHQGHSRDREKR 
ERERQELRILVGTNLVRLSQV 


7045 


3 


513 


I^FKMEALSRAGQEMSIiAALKQHDPYITSI^ 

KANQWEKTDI EGTIiFVYRR6ASPYHGFTIVimJ*MHNIjVBPVNK 

DLEFQLHEPFIAYRNASLSIYSIWFYDKNI)(^^ 

ETRR5QQA/RSGQXESQPGQWLQRPQAHRHPGDAEQSQG 


7046 


3 


513 


IjGFKMEAI^RAGQEMSLAALKQHDPYITSXAI)LTGQVAIiYTFCP 
KANQWEKTDI EGT1»FVYRRSASPYHGOT 

DLB PQLHE P FLLYRNASI.S I YS I WFYDKNDCHR I AKLMAD WEE 
ETRR5QQA/RSGQTESQIK3QWLQRPQAHRHPGDAEQSQG 


7047 


103 


486 


QMKIEKCEWSBGLTSIKGNCHNFWAISKI^rrYKE^KNLLNS^ 
IMIiIDVRBIWBIIiEYQIQPESINVPLDBVGEALQMNrPRDFKEKy 
NEVKPSKSDS/lVTCTYLAGVR^KK 


704B 


92 


627 


FFCLTLLSS WD YRHHATRRVI SS P VFTMKDSGKTFSSEB EBANY 
WKDIAMTYKQRAENTQBEIiRE FQEGSREYEAELETQLQQ IEYRN 
RDIiLS ENNRLRMELET I KEKFEVQHSEGYRQI SALEDDLAQTKA 
IKIXJI^KYIRELE<JANDDI»ERAKRATDHGLSKrFE\QRLH\QAI 
EKKW 


7049 


393 


938 


KRTGSAS YGGP P PGICGPATXAS VAGRCSS VGKI PARRCYEDEL 
VPVFEAVGRIYEI»RI>!^FDGKNRGYAFVMYCHKHEAKRAVREL 
NNYBIRPGRUK^CCSVDNCRLFIGGIPKMKKR^ 
EGVIjDVIVYASAADKMKNRGIiRIjRGVREP PRGCHWLGRKIil AWX 
ASSLWG 


7050 


393 


938 


KRTGSAS YGGPPPGLGGPATXASVAGRCSSVGKI PARRCYEDEL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre spondxng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


/Amino acid segment containing signal peptide 
(A^ Alanine, C«Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=*Histidine» I-Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R-Arginine, 
S ^Serine, T^Threonine, V=Valine, 
^Tryptophan, Y-Tyrooine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








VPVFI^VGRIYBIiRLMMDFIXjKNRGYAFW 

NWYEIRPG^I/JVCCSVDNCRLFIGGIPKMKKREBILEHIAKVT 

BGVUJVIVYTaSAADKMKNRGLRIaRGVRBPPRGC^ 

ASSLWG 


7051 


119 


816 


KJCMNIiABICDNAKKGREYAIJL£5NYD^ 

VRDPAIKGKWQgVRQELLEEYEQVKSIVGTLBSFKIDKPPDFPV 
SCQDBPFROPAVWPPPVPABHRAPPQIRR/RQSRSKTSEERMGR 
SRSPGTCRPSTXPISKSEKPSTSRDKDYRARGRDDKGRKNMQDG 
ASDGEMPKFDGAG YDKDLVEAIiERDI VSRN PS IHWDD I ADLEEA 
KKLLRKAGVIjPMWM 


7052 


467 


715 


SCPGRGKMSKbLNPEEMTSRDYYFDSYAHFGlHEEMTiKDEVRTIi 
TYRNSMYHimn/FKDKVVLDVGSGTGII^MFAARQGPRR 


7053 


467 


715 


SCPGRGKMSKIiIihn?KBMTSRDYTFDSYAHFGIIJKEMI.KDBVRTL 
TYRNSMYHNKHVFTXDKVVIiDVGSGTGII^MPAARQGPRR 


7054 


1 


1036 


GTSQRSRETDARRRSAGAEPXARLPWPAALEKWPSCPCEPLGPG 
RRCRWDAMKYDEKIiARFRQAHI^PFNKQSGPRQHEQGPGEBVPD 
VTPKEAI^ELPPGEPEFRCPERVMDLGriSBDHFSRPVGLPLASD 
VQQLROAIEECKQVILELPEQSEKQKI>AVVRLlHIjRIiKLQELKD 
PNEDBPNIRVMiSHRFYKEKSKSVRQra 

CTG CYYRCHS KCLNL ISKPCVSSKVS HQAEYE UN ICP ETGLDSQ 
DYRCT^CRAPI /CS/IXTm>SEARQCDY7^YYCSHCHlWDLAV 
IPARVVHIWDFBPRICVSRCSMRYLAIjMV^PVLRLRBIN 


7055 


2 


527 


DSRRVSWRSWI^E/WGKHLCLFIWI^MNVLLFWKTFIiIiYNOGP 
EYHYlJtQMLG/AI^SimeASWNIWCSLII^ 
SQKVPSRRTRRLLDKSRTFHITCGATI CIFSGVHVAAHLVNALN 
FSVNYSEDFVELNAARYRtoEDPRKLLFTTVPGbTGVCMEVVrjFL* 
M 


7056 


2 


527 


DSRRVSWRSWLA^/WGKHW^FIWI>SMNVLLFWKTFI^YNO^ 

EYHYIflQMI^/ALCI,SllAS7lSVLNLNCSU 

SQKVP SRRTRRLLD KSRTFHI TCGATI CIFSGVHVAAHLVNAIiN 

FSVNYSEDFVELNAARYRl)EDPRKIJ^Fl*rVPGI*TGVCMRVVLFL 

M 


7057 


1368 


431 

r 


GIYIJIWEKIPRPTCIGDRQElTOl^lim^ENHRDQEIaLHASCQA 
SGEVPSOASLRGFFTEDEPGCFGEGBHIiPEALQW IQDEGTGEQL 
SPQERISEKQLGgiliPNPJISGKKS'lTviWLEEKRETSQKGQPRAPM 
AQKLPTCRECX5KTFYRNSQ1.I FHQRTHTGETY FQCT I CKKAPLR 
S SDFVIOIQRTO7X5EKPGKCD YCGKGFS DFSGLRHHE K IHTGEKP 
YKCPICEKSFIQRSNFNRIIQRVHTGEKPYKCSH0GK5FSWS3SL 
DKHQRSHI^KKPF0*PVTKLSFPISISQPSHKN1^LHQBE^CLR 
GYPC 


7058 


1 


469 


FSGFGAWDAIi^MSDI^ITEAFLYMDYLCFRALCCKGPPPAR 
PEYDLVC I GIiTGSGKTS I»LS KJasS ES PDNVVS TTG FS IKAVPFQ 
NAIUWKJBIiGGAmiRKYWSRYYQGSQGVIF^ 


7059 


1 


1178 


WPAFPRQPAAAAMDAI^TGPRRARGCLGAAGPTSSGRAARTPA 
APWARPSAWLECVCVVTFDLBLGQA1£^ I 

CYLSFPDSHS gclgdtqfsfrmrqcggqrs pwhaddrhynsrap 

VAWR^PAHYFGYVYF^QVKDSSVKRGYFQKSLVLVSRLPFVRL 

FOJVLt»SLIAPEYFDKIJ^CLEAVCSRIIXiWPA^ 

GWVQVRI PSRVDKSESSPPKQPDQENLLPAPWLASVHELDLF 

RCFRPVLTHMO^WEU«UjGEPIa,VIA^ 

QPIJIFCCDFRPYETIHDSBPIGJFTTRTQ^ K 
TIiQHWPHILRVGBPKMSGDIjPKQVKLKKPFKV* RPWDTKP 


7060 


90 


1670 


S VNL PPSkWP WEEAMDS TKSE PLKGi> PBAEDGMI EYKKLVNPSQ 
YRFBHLVTQMKWRI>QKGRGPAVYQIGVEDNGLLVGIAEEEMRAS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P«=Phenylalanine. G=Glycine, 
H^Histidlne, I=Isoleucine, K=I»ysine, 
L^Leucine, M=Methionine, N=Asparagine, 
P-Proline, Q=Glutamine , R=Arginine, 
SaSerine, Threonine , V=*Valine, 
W -'Tryptophan, Y=> Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\*=possible nucleotide insertion) 








LKTLHRMAKKVGAD I TVLRKREVDYDS DM PRKI TEVT»VRKVPDN 
QQFLDIiRVAVLG^DSGKST3bIjGVI,TQ^ 

LHBIQSGRTSSISFEILGFNSKGEVHGINGXXiWGQTI^MGW*** 
RT*DGGRWRLFBIV*MNALRGL*TSSAPLRKSMGNQLN*IKNG 
VKIKRQGHPGNGLG PGNS EG VGRAGRRH * GPWALGQWNYSDSR 
TAEBICBSSSKMITFIt)UW3HHKYI^TIl^TSYCPDCAriI*LV 
SANTGIAGTTREHI/SIiAI^KVPFFTVVSKIDIiCAKTT^ 
QLERVbKQ PGCHKVPMLVTSEDDAVTAAQQPAQS PNVTP I FTLS 
SVSGBSI^LLKVFIiNIIiPPIiTNSKBQBBLMQQLTEFQVDEIYTV 
PEVGTWGGTLSR* IDLLATLPTQPSPIYSKTSWPKGGDPGI 


7061 


364 


710 


koi^cDi'/ioDf»r.t>UMnPRTl^REPETAHIjRPRGPCYOEVAGPRE 
ALWU^RBLCCQWLQPEAIISKEQMLEMIjVIiKQFI/yriiP PE IQAWV 
RGQRPGS PEEAAAL VEGIiQHDP *ARMPS PLGPPCLFVMDPETTL 
EEPETAPJ^PRGFCTQEVAGPREAIiARIiRELCCQWIiQPEAHSKE 
QMLEMLVI,EQFIjGTI>P PEI OAIfVRGQRPGS PEEAAALVEGI«QHD 
PGQLDG 


7062 


71 


744 


" AKAGTNLEJRJLHWI*S YFFCI PKHKLKSSQKDKVRQFMACTQAGER 
TAI YCLTQNEWRIJDEATI^FFQNPDSLHRES KRNAVDKKKIjERb 
YGR YKDPQDENKTGVDGIQQPCDDLSLDPAS I3VLVIAWKFRAA 
TQCEFSRKEFIJXSMTEIiGCDSMEKLKALIjpRliEQ 
PVAVTPTPA KWPfiORC J.DIj *MAGAYWKLVLSGRFKFL YLWNT Fl> 

MBHH 


7063 


2 


562 


I^TVPDLPGRRFRAMRTGQRR*PELPPW4MSLEQAEbtKAJ^iSKK 
LTBYIHCIiQPATGRWPJ^IWSVCTATGAWNWLIDPETQKVSF 
FTSLWNHPFFTtSCITLIGLFFAGIHKRVVAPSIIAARCRTVIiA 
EYNMSCDDTGKLILKPRPH\^*QSSIjIVMGIiKIAFI>RISDTAKS 
HKGFLIjRLDM 


7064 


300 


884 


rdtgsdpsstrrlcstcctgh*paepiasphpsrgtcppassas 

SRRTGCIfrCPPKSGHAQARRSRRASASRWGARGAVRSAVAARGC 

ssragrwletpgrrrgp pacaaaagrlrgpap* aapptasvpar 

CRCPAARTGAPAAATWLRRRLSGLRAPALGRRRSPGPSPKSAAP 
PLLTPLGAGPAGGSRANS 


706$ 


1 


555 


ATTTHSARRSGKGAAAEAAASAAGGRQKGPDRKAWEGRRTTPGG 
RSQSEPKAPPPQKRSBAAFASMAHS PVAVQVPGMQNNIADPEEL 
FTKLERIGKGS FGEVFKG IDNRTQQWAI KI IDLEBAKDBIKDI 
OQEITVLSOCDSSYVTKYYGSYLKGSKLWIIMBYLGGGSALDI^ 
RAGPFDEFQ 


706S 


356 


676 


PGPQRGPWRAREGGHPLDPADHPRAPASIiRSNVRAATMMQI CDT 

YNQKHSLFNAMNRFIGAVNNMIXJTVMVPSIiIxRD 

VGVEVGGSGGCLEERTPP 


7067 


152 


973 


KEN ITMATE IGSP PRFFHM PRFOHQ APRQLFY KRPDFAQQQAMQ 
QLTFDGKRMRKAVNRKTIDYfn°SVIKYLENRIKQRIX3RDMRAIQ 
PTJAGYYNDLVPPIGMliNNPMNAVTTK^ 

TPEGRRLVTGAS SGBFTLWNGLTFNFETILQAHDS PVRAMTWSH 
NDMWMLTABHGGYVKYKQSNMNNVKMFC^\HKEAIREARFIHNI P 
FSVVPIVMVK&FSKCILO»BMHGI^ 
H3PFCHAPF 


7068 


222 


816 


DTMKEWLLIiFLAliCSAKPFFSPSHIJ^KN^ 
DDDDDDDDDDBmSI^PTREPRSHFFPFDLFPMCPFGCQCYSRV 
VHCSDI^LTSVPTNIPFDTPJflKDLQNNKIKEIKEWDFKGLTSLY 
GLILNNNKLTKIHPKAFLTTKKLRRLYLSHNQLSEI PLNLPKSL 
AELRIHENKVKKIQKDTFKKK 


7069 


1147 


1765 


FRDHRRYFYV^SGESQWEFPIXSEEBEBESOAQEl^ETLAKQ 
TTMKTGTDSNST^SETSTGSLCKESFSGQVSSSSIjHPLTPFW 
T1MSNVPVLQPPLPLEMPPPPPPPPESPPPPPPPPPAPKMPPP 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AsAlanine, C=Cysteine, D^Aspartic Acid, B= 
Glutamic Acid, F~ phenyl alanine , G=Glycine, 
HsHlstidine, I»Isoleucine, K-Lysine, 
L=Leucine , M=Methionine, N-Asparagine , 
Paproline, Q=Glut amine, R=Arginine, 
S~Serine, T-Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








KKTKKGRKDKAKKSKTKMPSLVKKWQSlQREIiDEEDNS SS SEED 
RVSTAQKRIEEWKXX3QLVSGMABRNANFEA 


7070 


1 


547 


DGTMEDSEAVQRATALI EQRIjAQEBENEKI»RGDAHQKI»PMDLIjV 

LEDKKHHGAQS AALQKVKGQERVRKTSIiDLRREI I DVGG 1QHL I 

ELRKXI^QKXRDAIAASHEPPPEPBEITGPVDEETFLKAAVEGK 

MKVIEKFIAI)GGSADTCDQFIUITALHRASLEGHME 

ATVDFQ 


V&71 


2 


921 


ARGTLRAI^AKJCVGKVGANGQKAAGPSAOSV-rENKIGS PPKTP 
VSNVAATSAGPSNVGTELNSVPQKSSPFLTRVPAYPPHSEKflQY 
FODPRTOIPFPVPOYPfyTGYYPPP PTVPAGVAPCVDPT7VP QMKTlf 
PESSLPPASMPYADHYSTFSPRDRMNSSPYQPPPPQPYGPVPPV 
PSGM YAP V YDSRRIWRP PMYQKDDI I RSNSLPPMDVMHSS VYQT 
SI^ERYlKIJXSYYSVACQPPSEPRTTVPrj^POTHLKTSCEEQ 


7072 


2 


921 


ARGT1»RALBTAKXVGKVGANGQKAAGPSADSVTENKIGSPP 
VSNVAATSAGPSNVGTELWSVPOKSSPPLl'RVPAYPPHSENIQY 
FQDPRTQIPFEVPQYPQTGYYPPPPTVPAGVAPCVPRPVRSNNV 
PESSIjPPASMPYADHYSTPSPRDRMNSSPYQPPPPQPYGPVPPV 
PSGMYAP WDSRRIWPJJPMYQRIJDI IRSNSLPP>TOVMHSS VYO^ 

IPJUCPDQWAQYH^KAPLVSSTLPVATQSPTPPSTLNRGEGS 


7073 


50 


S04 


LAHGSFGVSDFPAPAAAPAHTLTSFSGSIiSPOFRKPIiGRAPAMP 
LVR YRJCWI LGYROrcKTSIaAHQFVEGEFSEGYDPTVKNTYS KI 
VTIiGKDKFHI^r.VDTAGrin'EYSTTiPYSPI TnVHGYVr»W«?VTQT- 

« lUwiu/iiC muiu w U IrttjyL/A x oinr x or A x^jvxivj X vxiv XOV X h>u 

HSFQVIESbYQKIiHEGHGK 


7074 


263 


1003 


VCPVLCSTRO^PGHSSLVTYFGKPTRRKSFLLGHCrAAGKMNIS 
VDLETNYAELVLDVGRVTLGENSRKKMKDCKLR1CKQNERVSRAM 
CALLNSGGG VI KAE I ENED YS YTKDGIGLtDLENS FSK ILLFVPE 
YLDFMQNGNYFTiIFVTCSWSLNTSGLRIT 

MMATAALE FliKDMKKTRGRL YIjPJPEIJLAKRPRVD I QEENNMKAL 
AGVFFDRTELDRKEKLTFTESTHVEI 


7075 


598 


1005 


NYXNFPFRKEYPPHVQKVEINPVPXSPJLQGVERIMKJCTEBS 
VKPEI KRIG/QQKRHGSTYQPTP PLSPAS KKdiTHLBDIjQRNCRQ 
AITLNESTGPLLRTS IHOJrfSGGQKSQNTGLTTKKFYGNNVBKVP 
IDII 


7076 


279 


1049 


LQSEaSNAAEGNEQRHEDEQRSKRGGWSKGRKRKKPIiRDSNAPK 
SPbTCYVRFMNERREQI^KRPBVPFPEITRMLGNEWSKLPPEE 
KQRYLDEADRDKKR Y MKELEQ YQKTEAYKVPSRKTQDRQKGKSH 
RQDAARQATHDHEKETEVKERSVFDIPI FTEEFLNHSKAREAEL 
RQLRKSNMEFEERKAALQKHVESMRTAVE KLEVDVIQERSRNTV 
I/X)HI^LROyLTSSFASMPLPBXGETPTVl>TIDSYM 


7077 


3 


1119 


SSMGSNSE INGLAIiRKTDKYGFLGGSQYSGSLKSSIPVDVARQR 
ELKWIJ}MPSNWDKWI^RRBX)KVIOjRCRKG IPSSLRAKAWQYIiSN 
SKELI^OJ*PRKFEELBPJ^DPKWLDVIEKDLHRQFPFlffiWPAA 
RGGHGGQBL YRILKAYT I YR PDEG YCOAQAP VAAVLJLMHMPAEO 
AFWCLVQICDKYLPGYYSAGLEAIQLDGEI FFALLRRASPLAHR 
HLRRQjRIDPVLYMTEWFMCIEARTXJP^ 
IFRVALVIiRHTEiGSVEKliRSCOGMxETME^ 
LVHEVTOTiPVTBALIERENAAQLKKWRETRGEIiQYRPSR 
RAIHEERRRQQPPLGPSSS 


7078 


483 


767 


FQGQRMAGEQKPSSNI.LEQFILLAKGTSGSALTALISQVLEAPG ' 
VYVFGELLEIJ\NVQEIiAEGANAAYLQIiLNLFAYCT IANKE 
SLPELY 


7079 


2 


376 


SWEPKRPKBPSGSDGESDGPIDVGQEGQLSQMARPLSTPSSSQ 
MQARKKRRG 1 1 EKRRRI>R FNSS LSBLRRL VPTAFEKQGSS KLEK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre sponding 
to firat 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^AJLanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, P=Phenylalanine, G-Glycine , 
H=Histidine, I*»isoleucine, K= Lysine, 
L^Leucine, M=Methionine , K=Asparagine, 
P=Proline, Q=Glutamine, R*=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan , Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








AKVIjQMTVDHLKMIiHATGGTGTHALLFQAS fiqqi f 


7080 


200 


595 


VQLPLBAPCLSLLSCRDHSGGNRDLSRRHRDCRVYGSPQDGIPY 
LTHPLCHQD\TVSV(^QII^IATPGHTQGHLVyi^nGBPyKGPS 
CLFSGDIiFI>SGCGEFPRKRBEI^BEGKTEVRAATVPWHALKP " 


/Ua± 


1 on 


1 "506 '" 


1 auTropMTT.wQr.cr^'VWJirr.f na PMVPvttn^.PMOT.T.ATJWGADi 
VYCEELIDLKMIQCKRVVNBVI*STWFVAPDDRVVFRTCKREQN 
RWFQMGTS 


7082 


3 


1137 


APSRimilJ^WCRGPVIiLCLRQGLGTNS FLHGLGQK P FEGARS L 

SSLENLRLPrLRKESSPRELKDSSGDQGRCGPTHQGSEDPSMLS 
OAQSATEVEERHVSPSCSTSRERPFQAGBLIIjAETQEGBTKFKK 
LFRLNNFGLIiNSNWGAVPFaKI VGKPPGQ I LRSS FGKQYMLRRP 

ALiliJJ I V VJjWJtXWJlrVJL 1 C i- it ) *J J. JJO Finis -LXT r ± V XJJ^rt\j<J\J-J'J 

GMSIjFIiSKAVGS qgrvi s fewkdhhdlakknykhwrdswklsh 
vee wdnvdfihkdisgatedi ksiitraavaldmlnphvtlpvf 
ypkxkhggvcpvyvvnitqviei.ld 


7083 


115 


541 


RSNAVQLTRMEYAMl^l^IaLYPKSLSRHVSVRTSWTQQIjLSBP 
SPKAPRARPCRVSTADRSVRKGIMAYSLBDIJjIxK^ 
PFFLVLEEDG'ri' V ETEB YFQAXjAGPTVFIWIjQKGQKWQPPSEQG 
TRHPLSL3HK 


7084 


3 


522 


NSVSVSSQSRFLASVPG1X5VQRSAAADMAASTAAGKQRIPKVAK 
VKNKAPAEVQITT^iliLREAKERKIiKIJoPPPPQQKITDEEEIjND 

RALD\mYPJ^ITLWLKYAEMEMKNRQVNHARNIWDRAlTTL 


7085 


243 


1499 


rqiaiu^rrrgwrspfggapmahitinqylqqvyeaidsrdgasc 

yavgnhdf i eaykoqtv i vqs flrafqahkeknw aip vmyaval 

dlrvfannaik»lvkkgkskv^^ 

ag iedskkwgmiffi»vnql»fki yfki nklhi*ckp1*iraidssnlk 

dotstaqrvtykyyvgpjcamfdsdfkqaeeyi^fafehchrssq 

KNKRMILIY1UjPVKMLLGHMPTVBL1»KKYHLM^ 

N1VLLIJIEAT^KHEAFFIRCGIFX.IT J EKLKIITYROT 

KTHQLS LDAFLVALKFWQVEDVD I DBVQCIIiAKLI YMGHVKGY I 

SHQHQKLWSKQNPFPPLSTGC 


7086 


256 


525 


ILAARMGKQNSKLRPE^W(H)JjI^ 

SGHl^?lEEFKKIYGNFFPYGDASKFAEHVFRTFT)ANGt)GTIDFR 
EF 


7087 


166 


723 


LSGS SAG KVAAP CVPP SNHELVP X TTEN APKNVVDKGEGASRGG 

NTRKSI^EDNGSraVTPSVQPHI^PIRNMSVSRTHEDSCELDLVY 

VTERI IAVSFPSTANEENFRSNLRBTJA^ 

ERRPDITKLKAKVLETCWPDLHTPALEKJC^ 

HRCRVLKNKG 


7088 


104 


759 


GTSAAS PSSLLEMAGK J TKTGKLYS S YVGLVYMFNLI VGTGALT 
MPKAFATAGWLVSLVl^VFl^FMSFOTTTFVIEAMAAANAQr™ 
KRMR1^IJ(EBEDDDSS17VSDSDV1>IR1)ITYERABKRPXLSVQRRGS 
PNPFEITDRVEMGQMASMFFNKVGVNLFYFCI IVYLYGDLAI ya 
/AVPFSLMO^rrCSATGNMCGVEADTKYmOT 


7089 


33 


1775 


S VCWEDRYLKARME ES V LSRAPSRGGVNFLWVART Y I PNTKVEC 

HYT^PPGTMPSASDWIGIFKVEAACVRDYHTFVWSSVPESTTDG 

SPIHTSVQFOASYLPKPGAQLYQFRYVNRGGQVaSQSPPFQFRE 

PRPMDELVTLEEAIX^DILLVVPKATVLQNQl^ESO^ERNDl^ 

QLKl^l^GQVTEljRSRVQELERAl^TARQEHTELMEQYKGISRS 

HGEITEERDILSRCK^DHVARILBLEDDIOTISEKVLTKEVB^ 

RLRDTVKALTREQEKliLGOLlCErVQADKEOSE 

Nl^IiKEAKSWQEEQSAQAQRIiKDKVAGMKDTLGQAQQRVABLEP 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
( A-Alanine , C=Cyateine, D-Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Hxstidine, I~Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine, 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, ^Threonine, V*= Valine, 
W=Tryptophan, Y«Tyrosine, X«Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 








I^QbRGAQELAASSQQKATI^EEMSAAAARDRTlAELHRSR 

LBVAEVlTOKLAEIXaflLKBBKCQWSKERAGLL^ 

I*SAB ILRJjE KAVQEERTQNQVFKTEIARKKD SSLVQLS ES KRElB 

TRLRSAlAVI^KBiaMLQE^EM^YMR^ 

EnATTEDEEAAVGLSCPAALTDSEDESPEDMRIiHPMAFVSVBTQ 

ASLLLGLE 


7090 


33 


1775 


s vcwedr yiikarmees pls rap5rgg vnflnvarty i pntfcvec 

hytlppgtmpsasdwigifkveaacvrdyhtfvwssvpbsxtdg 

spihtsvqfqasylpkpgaqlyqpryvnrqgqvcgqsppfqprk 

prp^riwtlbi^ux^sdillwpka^ 

qijo^legqvtelrsrvqeiaeraiatarqrhtelmeqy^ 

hgeiteerdilsrqqgi)hvarilelbddic^isbkvi.tkevbld 

RLRDTVKi^TREQEKliIiGQtiKEVQADKEQS EAE3LQVAQQENHHL 
NLDIiKBAKSWQEEQSAQT^RLKDKVAQ^ 

LKBQLRGAQEIiAASSQQKATLLGBBIiASAAAARDRTIAEIiHRSR 

LSAEILRLEKAVQBBRTQNQVFKTELAREKDSSLVQLSESKREL 
TELRSALRVLQKEKEQLQEEKQEIJjEYMRKLRARLEKVADEKWN 
EnATTEDKBA^VGr^CPAALTDSEDESPEDMRLHPMAPVSVElX) 
ASIALGLE 


7091 


186 


1076 


EGMLTKEI IRCGRSEEQBIiBPWPS PKKARSGRWIiRNGFKRKMBEP 
EEPAD3GQSI.VPVYIYSPEYVSMCDSLAKIPKRASMVHSLIEAY 
ALHKQmrVKPKmSMBE^TFH-rDAYIKJRXQKVSQEGDDDHPD 
S I E YGLGYDCPATEG I FDYAAAI GGATI TAAQCLI DGMCKVAIN 
WSGGWHHAKKDEASGFCYLNDAVLG ILRLRRKFERIIiYVDLDI*H 
HGDGVEDAFS FTS KVMTVS LHKFS PGPFPGTGDVSDVGLGKGR Y 
YSVNVPIQDGIODEKYYQICERYEPPAPNPGL 


7092 


522 


BQ9 


KQGINEDQEBSQKPRK3EGCEPI3KRQMKKL I KQKQWBEQREtiR 

KQKRKEKRKl?K3CIiERQCQMEPNSIX?HDRJCRVRRJ)W 

DCSFDXLM 


7093 


454 


655 


NFGVSGVELAQQASMVRMS FVIAACQLVLGLLMTSLTBSS IQNS 
BCPQLCVCBIRPWFTPQSTYRBA 


7094 


2 


508 


FVRSMHWG VG FAS S RPCWDLS WNQS I S FFGWWAGS E BP PSFYG 
DIIAFPLG^YGGIMAGI^SDPWWKKTLYLTGGAI*IAAAAYI>I>HB 
LLVIRKQQEIDS KDAI ILHQPARPNNGVPSLS PFCLKMETYLRM 
ADLPYG^YFGGKLSAQGKMPWIEYNHEKVSGTBFI I 


7095 


i ; 


411 


IASSLPKMASI^SDP^YI^VQGEKKVRAPI^QLYFCRYCSELR 
SLECVSIDSVDSHYCPSC^BNM^ 

TLSTRATS ISTQLPDDPAKTTMKKAYYl»ACG FCRWTSRDVGMAD 
KSVGR 


7096 


224 


2067 


BTRSIAVQBKPSQAGPjmSSRISFAGALFLTRFLIiQEIibLNNFC 

SAMSPAPDAAPAPAS ISLFDLSADAPVFQGLSLVSHAPGEAIiAR 

APRTSCSGSGERESPERKUWPT^ISEKl^CSTCIXyrFQNHQE 

Q RE H Y KLDWHR FNLKQRLKDKPLLSALD FE KQS S TGDLSS I SGS 

BDSDSASEEDLQTLDRRRATPEKZ*SRPPGFYPHRVTL^^ 

LYAYRO/I^PHQDPPEEAELUaQNI^S^ 

GAI FC^REWTHICrFHRYTVRAKRGl7\QGLRDARGGPSH3A^ 

LRR YNBAIT^YKDVRDLLAGPS WAKALBBAGT I LLRAPRSGRS L F 

FGGRGAPLQRGDPRLWDI platrrptfobi^rvlhklttlhvye 

EDPREAVRliHSPO/rHWKTVREKRKKPTEEEIRKICliDEKEAIjGQ 
NEESPKQGSGSEGEIKjFQVELELVELTVGTLDLCESEVIiPKRRR 

rkrnkkkksrdqeagahrtllqqtqeeepstqssqavaaplgpl 
udeaiwgqpelwnaijjvacragdvgvlki^i^spadprvi^^ 

LSAPLGSGGFTLLHAAAAAGRGSVVR1^EAGADPTVD<^DH 


7097 


256 


122B 


IRTKSAA*TWBAWPQCX3REGSRI1TBPC3BANA^ 
FLAAQGDQAFHSGIiETNNSNS ELPLRVGLKVAQGS PLMGGQVSA 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A« Alanine, (^Cysteine, D-Aspartic Acid, E*= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H»Histidine, 2>Isoleucine, K^fcysine, 
lj=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyro9ine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
V=possible nucleotide insertion) 








SNSPS£UiHCRNANEIWMSALCPRLWDVPIiHHLSIPGSHPTMTYC 

IJTKKSPISHEESRLIiQIiIjNKALPCITRPVVLKW 

IiDAGVRYIiDIAIAHMLI&SEKNI^ 

WLERHPREWILACBNFBGLSKDLHEYLVACIKMI FGDMLCPRG 
BVPTI^LWSRGQQVIVSyEDBSSIiimHHBLWPGVPYWWGNRVK 
TKAliXRYLBTMKSCGR 


7098 


82 


956 


SSFliKRCRKVjbGCWGIPSEQSLFSTTiEE^RDKEIDNYCVMRLQT 
HARSGFWAPNRPPWICRMTAVDGDRGGSSRETCRCHFHPSriKA 
LVIiliLQDWQPGGVGICTSFI^ISWALLDYHRALRrCLPSKPLLG 
LGSSVIYPLvm*LIiWPRVl^VALPSJUjFPSWAI^Fl^LWLVL 
IJ,WVW1^TDF^PDPSSEWLYHOTVATILYFSWF1^AKGRTRGR 
AI IHFAFLLSDSILLVATWVTHSSWLPSGIPLQLWLPVGCGCFP 
IXnJu^RLVYYm*LHPSCCWKPDPX>QVD 


7099 


992 


210 


LFPJjAPGPLRSI»ARQGYHQIWAFPFLPSGATATWPAASRSRSIA 
ARSIiPRSPARPGPNDAIilX5EHDFRGQGVR7iQRFRFSBBPGPGAD 
GAVIiEVHVPQIGAGVSIiPGrbAAKCGAEVII^SnSSBLPHCLBVC 
RQSCQMNW^Hl^WG^TWGHISWDI^Uu^PPODIIIJ^DVPFBP 
KDFKDILATIYFLMHKNPKVQLWSTYQVRSADWSLEAIJiYKWr^ 
KCVHI PLES FDADKKDIAESTLPGRHTVEMIjVI S FAKDS 1> 


7100 


205 


671 


ANGGFWEAAPGSBVSLPLWVPTASHSKTTALGIGSAPPPHLSVIi 
FliFSPPPQLGDPIiEAFPVFlOCYDRNGUWSXECKRVSGIiBPATV 
DWAFDLTKTTWQTMYEQS EWGWKDRE KREEMTDDRAWYLI AWEN 
SSVPVAFSHFRFDVBRGDEVLYW 


7101 


2 


503 


WRGGPRRAKRIiAGGAVGWVIJjVRGVHSVRAGGGRPPPJVADMKKD 
VR1I»I»VGEPRVGKTSL1MSI>VSEEFPKEVPPPJVBEITIPADVTP 
KRVPTIirVDYSEAEQSDRQLHQE ISQANVICIVYAVNNKHS IDK 
VTSRW I PLlNERTDKD3PJUPIiIljGGNKSDLVBYSR 


7102 


2 


503 


WRGGPRRAKRLAGGAVGWLLVRGVHSVRAGGGRPPPJUUlh5KJ^ 
VRILI#VGEPRVGKTSLIMSI»VSBEFPEEVPPRAEEITI PADVTP 
ERVPTHIVDYSBAEQSDEQLHQEISQANVICIVYAVNNKHSrDK 
VTSRW I PL INERTDKDSRLPLILGGNKSDLVEYSR 


7103 


ii£ " 


438 


GSQSSVAVNIRSGTDBESMDIMKGQASSVNIAATASEKSSS3BS 
r^DKGSEIJCKSPDAWFITVI.KVTPBEYAGQITLMDVPVFKAIQP 
DELSSCGWHKKEKYSSAP 


7104 


1670 


795 


RLWKHRSVSAGASGWGLSS PGCrJjLHPSbPBEERVDI LIKNAGV 
MRCPHWTTEIXjFEMQFGVNHIjGEAWAGAAPWVQAII*PRRPPKVL 
GF*V* VKSDUFI IIJIPGHFIiLTBn^JJ>iaao^APSRlI»I^SIA 
HVAGH IDFDDLNWG^TRKYHTKAAYCQS \KLAIVLFTKBLSRRLQ 
GSGVTVNALHPGTVARTELGRHTGIIIGS TFLQHHN\ WAIILLAAWS 
KS PRSWPAPAQmmiAVAEEl^\ V1SGKYPI)GLKQKAPAPEAED 
BEVARRLWAESARLVGLEAPSVRBQPLPR 


7105 


765 


143 


GQMCRRPSPKSTSCLSMTCDLP/RGLODPQCLALFRVAVUKHQA 
IiKAAMSGOGVDRHL&'ALY I VSRFT.HLQSPFLTQVHSEQWQLST 
SQIPvQQHHI*FDVHNYPDYVS SGGGFGPADDHG YG v S Yl rMGDG 
H ITFH I S SKKSS TKTDSHRLGQHIEDALLDVAS L FQAGQHPKRR 
FRGSGKENSRHRCGFLSRQTGASKASMTSTDF 


7106 


14" ' 


1064 


GIiQAGHPHPRSASRIPBADTH \ YSKLQPJVPDSIVNKDHKRMFGT 

YFRVGFFGSKFGDLDEQEFVYKEPAITKLPEISHRLEAFYGQCF 

GAEFVEVIKDSTPVDKTKLDPNKAYIQITFVEPYFDEYEMKDRV 

TYFEJ(NFNLRRFWYTTPB~rLEGRPP^FXHEQ^ 

PYI KTR I S VI QKEE FVLTP IEVAlEDMKKXTTiQLAVAXNQEP PD 

AKMIJQ^fVLQGSVG^TVNQGPLEV7^VFIJAE I PADPKLYRHHNKI* 

RLCFKEFIMRCGRAVEKNKRLITADQREYO£^ 

RPMIERKI PEI*YKP IFRVESQKRPSFHRSS FRKCETQLSQGS 


7107 


1145 


591 


*I*WI»OTGKKK 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corxe sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cyeteine, J>=^Aspartic Acid, E» 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine, K=Lyslne, 
L=Leucine . M=Methionine, N=>Asparagine, 
P^Proline, Q=Glut amine, R=Arginine, 
S^Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X-Onknown, *=»Stop 
Codon, /^possible nucleotide deletion,. 
\=pos3ible nucleotide insertion) 


7108 


1 


942 


VKVAI^LTNJ^QPRT£^BWENSFTLKMFLFQFVNI^SSTFYIAF 
FWSRFTGHPGAYLRLlNRiraiiEECHPSGCLIDLCKQMGI IMVLK 
QTWNNFMEIX5YPLIQNWWTRRKVRQEHGPBRKI SFPQWEKDY^ 
0PMHAYGLFDEYLQ4ILOFGPTTI PWAAFPIA.PT .T.AT.T.WWT tt?t 

RLDAiTCFVTQWRRPLASRAKDIGIWYGIIJSGIGIIiSVITNAFVI 
AITCDFIPRLVYAxTCYGPCAGQGEAGQK^^ 

DEENRSBPBSlXJSEFSGTPLKYCRYRDYRDPPHSLVPYGxTLQP 
WHVLAW 


7109 


964 


102 


TQPFPKSEQVYUun^\nnrEIX^EPKD^ 

SE KXiATDTSTFEATSEGTLEL^QRN P KAKRLRWS PAQEKS FRQM 
WIHKBIPTGKKDHEC^ECGKTFIYNSHLVVHQRVHSGBKPYKC 

QRIHSGEKPYECNBCGKAFSQSSYLSQHRRIHSGEKPFICKECX3 
KAYGWC3ELIRHRRVHARKBPSH 


7110 


96 


697 


RT.TlW TyQfSPT.V RU'l 'XCVTfX>XX n/VDT.Vrro vp r inrnMT ■pdkct'pdiit r»" 
nuun r aor u v c v x JMaJSiut x v Mr u I l/KXKLwAunitlKAox J^VIAv 

SPSTKRRGQMLQPI IEGETAHFFEB I KEEEEDGVNLSSELGDML 

KTAVQVQSSLKNSESDVERTtfQRKLAI^ 

LWKAIUUlKKKLRiCTIjIffiFEEAFY^ 

iTOCIKAKLRJ^LBVLISKQDSSKSl 


7111 


2 


414 


G3GLYRGPTPGGQCIWK^NSMPPDHERNFt3F^ 
LKRSLPSTiyiTUjRPDQRYLEEX^ 

GAVLW 


7112 


103 


495 


PRCFPVADRGRLIGGLPDWTIMEGKTLNLTCTVFGNPDpKVIW 
FKNDQDIQ^SEHFS VKVEQtfVKYVSMTI KGVTSEDSGKYS INI KN 
KYGGEKIDVTV3VYKRGEKtPimPPO^Aia?KLIPASASAAGQ 


7113 


1 


824 


KCI^OAWHEAPSSIAFTRWCSREERAEGGGNIiHRS ITRDPKPPG - 
I^PSQRPMDDKKKK^PKPCLAQPAQAPGTLRRVPVPTSHSGSI^ 
ALGLPHr,pSPKQRAKFKRVGKKKOTP V1AGGGSGSAGTPLQHS F 
LTEVTDVYEMEGGLLNU^NDraSGRIjO^ 








DQIJ>SNLGSCLGALVPGGMRGGEGTY S QSHS WALG K KVG VHG^ K 
SSGPLNLPRR 


7114 


3 


1492 


VWEVDBOIDHYKESODKFIiWOAAPIGKETLKni^RORrKT r*» K T 

I YLHTD FVS VKQRL pkyys wercskhhlnflgqnrs ywkkddg 

CKAYWKVCLHYNLHKAQPAERFFDPNQRGKW 
QTGEKLYKCTRCG1CVFT.QKM[LVW 

KPWCDKCPKAPKSSYHLIRHEKTHIROAFYICGIKCTTSSLIYQ 
RIHTSBKPQ^SBHGKACDEKPSPTKHWRITI^ 
FRGKSHLSVHQRIHTGB KP YECS I CGKTFSGKSHI^VITHRTHTG 
EKPraCRJ*CX3KAFGEKSTLIVHQRMHTO 

S PL I KHQRIHTGERPY ECTD CKKAFSRKSTIjIKHQR I HTGEKP Y 
KCSECGKAFSVKSTLIVimTHTGEKPYEaUXrGKAPSGKSTLI 
XHQRSHTGDKNL 


7115 


1 


947 


NAAHG YN WGLWCMY 1 1 PPQDWLDRGDESAP I RTPAM I GCS FWD 

REYFGDIGLLDPGMEVYGGElTVTnjGMRW 

HlWTRKPYlTODxOTlAKRMAIdtoA^ 

SNPGVDFGDVS ERIiALRQRLKCRS FKWYLENVYPEMRVYNNTLT 
YGEVRNS KASAYCLDQGAEW3DRAILYPCHGMSSQLVRYSADGL 
LO^PI^TAPLPDSKCLVDDGTGRMPTLKKCEDVARPTC^WD 
PTO^GPWSP>ATGRCLEVEMSKDANFGLRLWQRCS GQKWM IRN 
HIKHARH 


7116 


866 


95 


RVRMRRHAEVIEEKIiSMKS WAKFR PGEPWKGYPNIDPETDP YVT 
PGSVTNNIiS IimniEVDHLRDRNSGSSSSLKTTLPSTSAWSSIR 
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SEQ 
ID 
NO: 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 

1C31UUC Ul> 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino auJiu 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, O-Cysteine , D=Aspartic Acid, B=* 
Glutamic Acid, F^Phenylalanine, G=Glycine r 
H-Hietidine, I^Isoleucine, K«Lysine, 
L=Leucine, M=Methianine, N-Asparagine , 
P=Proline, Q=Glutainine , R=Arginine, 
S=Serine, T-Threonine, V=Valine, 
W-Tryptophan, Y=Tynosine, X-Unknovm , *-Stop 
Codan, /^possible nucleotide deletion, 
\=poseible nucleotide insertion) 








ASNYNVPI^STAQSTSARNSDSKDTWSPGSVTHTSIiAHELWKW 
LPPKNITAPSRPPPGLTGQKPPLSTWDNSPLRIGGGWGKSDARY 
TPGSSWGBSSSGRIT1WLVLKNLTPQIDGSTLRTLCMQHGPLIT 
FIILNI^HGKflJjVKYSSKEEVVTakQKSr^ISDLFI^TL 


7117 


695 


1261 


LLI STPGGCHPPPSS IKFTYTGAWGKALPAPHMPCAPGALPQGA 
PVSQAARAIPLLQPSQAAQAEGI^QPARACX5ALCSLPWPLRNWG 
SPIUiLPGGIOTPTm>RKTR^ 

GKVCLRHPRPTGVRGGPGAAGRQGGMGTRRRGTFTSGARDPGGL 
RVKHRCQPTGHLP 


711B 


49 


1863 


PHCEPNPGAGAM^LHVLFEHAVGYALIALKKVKKISLLQPQVK 

BSVLNLGKFHS IVRLVAFCPPASSQVAIJSNAKAVSBGVVHBDIiR 

LLLBTHLPS KKKKVLLG VGDPKI GAAIQF^LG YNCQTGGVI AB I 

I^GVRIJiPHNLVKGXTDI^C3CAQI>3LGllSYSRAK^ 

NMI IQS ISLIiDQLDKD INTFSMRVREWYGYH FPELVKI INDNAT 

YCRLAQFIGNRIIBIiNHDKIjBK^ 

MDISAIDLINIl^PSSRWSIiSKYRQSnHTYLRSKMSQVAPSIjS 
ALIGEAVGARLIAHAGS LTNbAK YPASTVQ I LGAEKALFRALKT 
RGNTPKYGLI FHSTPIGRAAAKNK6RISRYLANKCSIASRIDCF 
SBVPTSVFGEKIiREQVRERLSFYETGEIPRJarLDVMKHAMVG^E 
EAAAEITRiaEKQBKKRI.KEEKKRIJVAX*AI^SKMSSSTPEECT 
EMSEKPKKKKKQKPQEVPQKNGMBDPSISFSKPKKKKSFSKEKI» 
MS3DLE6XAG STS I P KRKXSTOKEBTViroPEEACKniSGSKKKRK 
FSKEEPVSSGPEEAAGKSSSKKKKKFHKASQED 


7119 


49 


1863 


PHCEPNPGAGAMV1jIiHVI»FBHAVCT 

ESVLNLGJCFHS IVRLVAFCP FAS SQVALJSNANAVSEGVVHBDLR 

I>t»LETHI*PSKZKK\n^I/jVGDPKIGAAlQEEIjGW 

LRGVRI*HFHNLVKGLTDI.SACKAQ^ 

NMIIQSISLLDQIiDKDINTFSMRVREWYGYHPPELVKIINDNAT 
YCRIiAQEIGNRREr.NBDKLEKLEEIjTMDGAKAKAirjnASRSSiyra 
MD I SA1DL IH IES FS SRWSI^E YRQSUfTYLRS KMSQVAPS hS 
AltlGEAVGARL lAHAGSLTNLAKYPASTVQIU^EKAJUFTlAIJCT 
RGNTPKYGLIFHSTFIGRAAAKNKGRISRYIJVNKCSIASRIDCF 
SEVPTSVFGEKJjREQVKERIjS FYETGE I PRKNIiDVMKEAMVQAE 

EMSRKPKKKKKQKPQEVPQENGMEDPS ISPSKPKKIOCSFSKEEL 
MSSDLEETAGSTS I PKRKKSTPKEBTVHDPBBAGHRSGSKKKRK 
FSKBEPVSSGPEEAAGKSS3KXKKKFHKASQED 


7120 


1991 


64 


QW5TRRCXRGDKVTNftMQDFhVTNra 

NSTTPL I PVIjS PGTDPAADL YKFAEEMKFSKKI^A rSLGQGQGP 
RAEAMMRSS I E RGKHVFFQNCZHLT^PS WMPALERL I EH1NPDKVH 

RDFRLWLTSLPSNKFPVS ilqngskmtiepprgvramllks yss 

LGEDFI^SC^KVMEFKSLLLSLCLFHGNAI^RRKFGPUSFNI PY 
E PTDGDLR1 CIS QLKMFI^EYDDI PYKVLKYTAGE INYGGRVTD 
DWDRRCIMNILEDFYNPDVLSPEHSYSASGIYHQIPPTYDLHGY 
LSYIKSLPLNDMPEI FGLHDNANITFAQNETFALIiGTI IQLQPK 
SSSAGS CX5REBI VEDVTQNIIiLKVPEPIlCUjmWAKYPVLYEES 
I^TVXVQKVIRYIJRLLQV^ 

AELYNNTVPEXWSAKAYPSLKPLSSWVMDLI^RiaJFU^AWIQDG 
IPAVFWISGPPFPOAPLTGTI^NFARKFVISIDTISFDFKVMFE 
APSELTQRPQVGCTIHGIiFLEGARWDPKAFOLAESQPKELYTEM 
AVI WLL PTPNRKAQDQDFYLCP I YKTLTRAGTLS TTGHSTNYVT 
AVE I PTHQPQRHW I KRGVAI, I CAIjDY 


7121 


2 


546 


RPLRPWVI^I^MVGLMTYGPJ^FQSIJmmRil PPFREASAK 
LTTLVDADAEAFTAYLBAT^PKOTP 

SVPLTLAETVASLWPALQELAR(^NIACRSDLQVAAKALE^ 
GAYFNVLINLRDITDRAFK1X>IHHRVSSL^ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first . 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C^Cysteine, D^Aspartic Acid, E~ 
Glutamic Acid, FePbenylalanine, G=Glycine, 
H=Histidine, I=Xsoleucine, K=Lysine„ 
I»=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Qs=Glut amine, RwArginine, 
ScSerine, T=Threonine, V^Valine, 
W-Tryptophan , Y^Tyroaine, X-Onknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=poesible nucleotide insertion) 








ETRQK 


7122 


2 


546 


RPIJlPWVI^I^SMVGLMTyGRRQFOSLDTTMRRLIPPPREASAK 

LTTLVDADAEAFTAYI^AMRLPKNTPKBKDRRTARLQEGIJIRAV 

SVPI>Tl^TVASLWPAl>QEI^CX3mACR5DLQVAAKALEMGVF 

GAYFNVLINLRDITDfi^K^ 

ETRQE 


7123 


1 


1092 


KPAVPEARSAGTSEAGRSGAEEVS CGSVSGDGAAMRLTPRALC3 

AAQAAWRENFPLCGRDVARWFPGHMAKG 

VHDARlPLSGRNPIiFQETLGLKPHIXVLNK>© 

HLBGBGU(KVIFTNCVKDBNVKQIIPMVTEI«IGRSHRYHRKENL 

BYCIMVIGVPNVGKSSLIN6LRRQHLRKGKATRVGGBPGITRAV 

MSKIQVSERPLMFIiDTPGVIAPRIBSVBTGLKI^CGTVLDHL 

VGBETMADYIjLYTLNKHQRPGYVQHYGIXjSACDNVKRVIjK 

KLGKTQKVKVLTGTGNVNVIQ 

LDLDVXiRGHPRV 


7124 


2 


3B2 


LPLTLHJUVPFAHLId^ 

ANSGLQLLGYFIALGGWVGI IASTALPQWKQSSYAGDASIQLR8 
KVFVLKSEWGGDSLGLPRDGGWSCLLHSAVRSBKGFWS 


712S 


166 


1127 


HCISBKR1WSFSMQKGKGRTSRIRRRKLCGSSESRGVNESHKSE 
FIELRKWIjKARKFQDSNIJ^PACFPGTGRGIJ^ISQTSIjQEGQMI is 
LPKSCLLT\RimriRSyiiGAYITKHKPPPSPLLAI>CTFLVSEKH 
AGHRSbLKAXYI^KI I^KAYTCPVCIiKPEVVNLI.PKSKKAKAEEQ 
RAHVQEFPASSRDFFSSI^PLF7UiAVDSIPSYSAIiWAWan7NT 
PJVVYIi\SPGSGNAFIiQSRT^VQLAPYLDIiLNHSPHVQVK^ 
ETHSYEIRTreRWRKHEKVFICYGPHTDNQRLFLEYGFVSVHNPH 
ACVYVSRGWNQLCS 


7126 


1 


733 


CRDMAAFIVPSPARRC^QKGSLGHLPTQPWIiWAAMSPRGQERGT 
SHSOAREPQRPGR^I^SIiQSSPGTI^QAGTASRRRGCMVQRWV 
QVATGRRAVQV P KGALGIALGETS PGASRGMS&3AGGCWALGWA 
PSPVLPSWLI^GPPPWLSIISDSGTQRPSPRRCPARPSPWGPQC 
WRGGRIASAEASST*rPGSGSRARSGRRSPGSRRRSASAPSPTP 
PTDACA*SCVARPAGSRSSRPAAA 


7127 


1311 


277 


GLPAMCST* KAGYYEBTEGDCI PKDR* IEKRPFKE I * RRIPRI F 
AKQKQ1 * 3 *NSQKIGASEIDRGRKEADCSDAPAAARIGAVSVFR 
RSTQBARVS PRSNAKSANIiRAVRAD* WEHFVLLFHTPRQFIAEC 
ICRST* *K*WHQI»C* PLSSI»*TGIiKRKI.LL* VLFRI * WLKDCDV 
* FOQKI FATNFCNWQNLIQ* EE * KPVE YSVEN* H IMNLLI»PM + L 
OQSSLRDQTlVTWRM*RNYSMFRINM I SSL* DGS IHI PLKT^FY 
PALIFTLTVP.TNSCCQRPT.PLFAIIQS I KTLASSGSPMLACLRFL 
LVKKRAFIHTPRSPGCS V* CXHVLVKDNKNNCVGSBV 


7128 


2 


5228 


GRVDLWTILLGRSAI»RBI>SQIBABLNKH^^ 
SSABXVKWJIQTVASPIiKEriGIiRlSKFIfiLDEEQ 

dypjgtrdsvktvi^derqsqalilkiadyyyeertcii^cvlel 
ltyfqderhpyrveyaixrvdkl.ekelvskyrqqfeblykteapt 
wetiignlmterq vsrwfvqclreqsmllb i iflyyayfemapsd 
llvltkmfkeqgfgsrqtnrhlvdetmdpfvdrigyfsalii»ve 

GMDIESIJlKCAI^DRREJtflQFAQI^ 
APVIJ^WALURHTIOTEBTSSVVREa^ 
SIASGGNDCTTSTACMCVYGLLS FVLTSLKbHTLGNQQDI I1TIA 
CBVLADPSLPEJjFWGTEPTSGIjGI ILDSVCGMFPHI*LSPLLQL1i 
RAL VSGKS TAKKVYS FLDKMS FYNEL YKHKPHDVTSREDGTT»WR 
RQTPKliYPI^G<yrNLRlPQGTVGQVMIJ>DRAYLWWEYSySSW 
TLFTCEIEMLIjHVVSTADVIQECQRVKP I IDLVHKVI STDLSIA 
PCLLPITSRI Y>^I J I^RLTTVISPPVDVX^CV^^TV1 J AARNPA 
KVWTOIiRHTGFXPFVAHPVSSLSQMISAEGMtJAGGYGNL 
QPQGEYGVTIAFLRI* iTTLVKGQr^STQSQGLVPCVMFVIiKEML 
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SEQ 
XD 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alaaine , C= Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F<= Phenylalanine, G=Glycine, 
H=Histidine, I»=Isoleucine, K=Lysine, 
L=Leucine, (^Methionine, N=Asparagine , 
ps=Proline, Q-Glutamine, R=Arginine, 
S^Serine , T= Threonine, V« Valine, 
W«Tryptophan, Y^Tyrooine , X=Unknown # *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PSYHK^YNSHGWEQIGCLlIxBLIHAILNLCHETDLHSSHTPS 
LQFLClCSLAYTEAGQWINIMGlGVIXriDMVMAAQP 
QGQGQLLIKTVTCLAFSVTNNVIRLKPPSNWSPLKQALSQHGM 
GNNLIAVLAKYI YHKHDPALPRLAI QLLKRLATVAPMS VYACLG 

ndatuu:rdafltiu^k\ie\i»irik\vmil\efltva\vetqp 

GLIELFLNLEVKI>3\SDGSKEFSU3MW\SCLHAV/VWEIjIDSQO 

QDRYW CPPLLHRAAI AFLHALWQDRRDS AMLVXJH'KPKFWENIjT 

SPLFGTI^PPSBTSEPSILBTCALIMKIICLEIYYVVKGSLDQP 

IaKDTLKKFSIEKRFAYWSGYVKSIJVVHVABTEGSSCTSI*LEYQM 

LVSAWRMLLI IATTIIADIMHLTDSVVK^QLFLDVLDGTKAIjLI.V 

PASVNCLlO^SMKCTIiLLILLRQWKRELGSVDKTLGPLTEILEG 

VLQADQQLMEKTKAKVFSAFITVLQMKEMKVSDIPQYSQL 

CBTlKJEEVTALFDQTRHSIiAI^SATEDIQ^SMEro 

RPGVCVLGLHIAKBLCE\nDETX3DSWLQVTRRLP I 

SI^KQNLHFTEATLHIJXTI^TQQGATAVAGAGITQSICLPL 

LSWQI^TNGTAQTPSASI^IJWSWPGVYRIiSMSI^EQLLKT 

LRYNFLPFJVLDFVG^QERTIjQCLNAVRTVQS 

FlLCS^FMKBWHmPQIMRDIQVNIiGYIiCQACT^ 

qhylqnkngdglpsav\aqrv\qrppsaasaapssskqpaadtb 

asbqqalhtvqygi»lkilskxim^^ 

e ynflfals fttptfds evapsfgtllatvnvalnhttjgeldkkk 

epltqavgi^tqaegtrtiiksiilmptmencftli»isqamryiir 

PAVHPRDKQRMKQEIiSSEIiSTLIjSSLSRYFRRGAPSS PATGVLP 
SPQGKSTSLSKASPESQEPLIQLVQAFVRHMQR 


7125 


1 


1054 


FRR FR WRRRLH * AGPAS S AGGS PGKAS GTMS GELPPN IN I KEPR 
WIXJSTFIGR^HFFTVTDPRNirJiTNEQLESARXIVHDYRQGIV 
PPGI»TENEI»WRAKYI YDSAPHPDTGEKMIIilGRMSAQVPMNMT I 
TGCMMTFYRTTPAVLFWQWINQS FNAWNYTNRS GDAPI/TVNEL 
GTAYVSATTGAYATALGLNAIjTKHVSPL IGRFVP FAAVAAANC I 
NI PLMRQRELKVGIPVTDENGNRLGESANAAKQAITQWVSRI L 
MAAPGMAIPPFIMNTLEKKAFIJCRFPWMSAPIQVGLVGPCIiVFA 
TPLCCAIjFP QKS SMS VTSIjEAEIiQAKI QE SHPELRRVYFNKGIi 


7130 


2 


780 


HBVPSIiQTSDPLPGSVQRCSVVVSOPNKENWCXJDHLYNSLGRKG 
rSAKSQPYHRSQSSSSVLINKSMDS INYPSDVGKQQIbLSIHRSS 
RCES H<^LLPDlADSHQQGTEKI>SDLTI^DSQKVVVVNRNL PLN 
AQIATQNYFSNFKETDGDEDDYVEIKSBEDESELEIjSHNRRRKS 
DSKFVDADFSDWC^GNTIJISLNSPRTPKKPVNSKLGLSPYLTP 
YNDS DKIjNDYIjWRGPS pnqqnivqsi*rbkfqci*ssssfa 


7131 


805 


573 


AAAEGHIEVViaPIilRACICVNPFAKDRW^^ 
KIJjQDYQDSYTLSETQAEAAAEAIiSKENLESMV 


7132 " 


1420 


1087 


I DMLLLSGALVSGP YTL ITTAVSADIXH'HKSLKXjNAHAIjSTVTA 
1 1 DGTGSVGAALGPIiLAGIjLS PSGWSNVFYPILMFADACABLFL I 
RLIHKELSCPGSATGDQVPFKEQ 


7133 


2 


3648 


QQ J PGIJjPAHGESGDAIiRKPRLQKP I TGHliDDLFFTL YPSLEKF 

BBEIiLEIJWQDHFQEGCGPIiDGGAI^IIiERRI*RVGVHNGLGFVQ 

RPQWVL VPEWDVALTRSAS FSRKWS SSKTS SGS qal vlrsrl 

WjPEMVGHPAFAVIFQIjEYVFSSPAGvPGNAA^ 

MVRWAVWNPIJ^EADSGRVTLPIJ^IQPNPSHCLVYKVPSASMS 

SEEVKQVESGTLRFQFSIiGSEEHUDAPTEPVSGPKVKRRPSRKP 

PTSPSSPPAPVPRVLAAPQNSPVGPGLSISQLAASPRSPTQHCL 

ARPTSQbPRGSQASPAQAQEFPLEAGTSHLEADLSQTSliVLETS 

IABQIjQBLPFTPIjHAPIVVGTQTI^SAGQPSRASMVLIjQSSGFP 

EI IiDANKQ PAEAVS ATEP VTFNPQKEESDCLQSNEMVLQFIAFS 

RVAQDCRGTSWPKTVYFTFQFYRFPPATTPPJUQLVQLDEAGQPS 

SGALTHI IiVPVSRDGTFDAGS PGFQIiR YMVGPGFLKPGBRRCPA 

R YLAVQTLQI DVWDGDS LLL IGSAAVQMKHLLRQGRPAVQASHE 
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SEQ 
ID 
HO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine , C-Cysteine, D=Aspartic Acid r K= 
Glutamic Acid, P^Phenylalanine, G=Glycine, 
HaBistidine, I=Isoleucine, K=Lysine, | 
L=Leucine , ^Methionine, N-Asparagine, 
P= Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine # V=Valine, 
W^Tryptophan, Y=Tyrosine, On known, +»Stop 
Codon, /^possible nucleotide deletion, 
\ ^possible nucleotide insertion) 








LEWATKYBQDNMWSGDMLGFGRVKPIGVHSVVKGRLHLTLAN 
VGHPCEQKVRGCSTIiPPSRSRVISNDGASKFSGGSIXTTGSSRR 
KHWQAOKLADVJ^Er^AAMIJjTHARQGKGPQDVSRBSDATRRRK 
LERMR5WRI£RAGGDLGRRGTSVI^^ 

BRT KAES IAS LLSIAITTKHTLHATIX3VABFFEFVI*KKPHNTQH 

TVTVBIDNPELSVIVDSQBWRDFKGAJV3I^^ 

APQ3jY1»RPHBTAHVP FKFQSFSAGQIjAMVQAS PGLSNEKGMDAV 

SPWKSSAVPTKHAKVLFRASGGKPIAVIXIjTVELQPIIVVPQVFR 

BYHPEI*SFX*KKAIRI*PPWHTHPGAPVGMLGEDPPVHVRCSDPirv 

ICBTQNVGPGKPRDI FLKVASGPSPE I KDPFVIJ YSDRWLATPT 

QTWQVYI^SI#RVDVSC^AGQI.TPO^LV^ 

PQELKTDPKGVFVLPPRGVQDLIIVGVRPIiRAGSRFVHLNIjVDVD 

CHQLVASWLVCLCCRQPLI SKAFB IMIAAQKGKGVNKRITYTNP 

YPSPJRTPHQISDHPBI^RFRBDSFQVGGGETYTIGLQFAPSQRV 

GBBEILIYINDHKDKNBEAPCVKVIYQ 


7134 


2115 


1111 


GGKGFSYPPHVGLSI^TPliDPHYVLLEVHYDNPTYKEGLIDNSG 
LRIiFYTMDlRKYDAGVI EAGLWVSIiFHTI PPGMPEFQSKGHCTL 
ECLEEALBAEKPSGIHVFAVI^HftHIJ^RG^ 
I^YDDDFBFNPQEFQYLKEEOTILPGDlJLITECRYNTKDRA»«r 
W^LSTRSEMCLS YI^YYPRINLTRCASI PDlMEQtiQFIGVKEI 
YRPVTTWPFI I KSPKQYKNT>SFMDAMNKFKWTKKEGLSFNKLVIj 
3LPVNVRCSKTDNAEWS iQGMTALPPDIKRPYKAEPIiVCGTSSS 
SSLHRDFS 1NLLVCLLLLS CTfcSTKSI. 


7135 


2 


2072 


fvprvrprslsiiqgpkgesvgsxtqplpssyli fraas esdgrc 
wxid ale la lr cs s w jrlgtckpgrdg e pgts pdas p s s lcglp a 
satvhpdodlfpi^gsslendafsdkserenpeesdtetqdhsr 
ktesgsdqsbtpgapvrrgttyvbo^Qeelgelgbasqvetvse 

BNKSIJWTIMQLRPGMDLSRVVIiPTFVI^PPvSFL^KI^DYYYH 

ADLLSPJVAVEBDAYSRMKLVLRWYLSGFYKKPKGXKKPYNPILG 

BTFRCCWFHPOTDSRTFYIAEO^SHHPPVSAPHVSNRKDGFCIS 

GSITAKSRFYGNSLSALI^KATLTFIWRAEDYTL 

ILYGTWTI J EIiGGKVTIECAKiraFA)AQI»EFKI^FFCGSTS INQI 

SGKITSGBEVIASLSGHWDRDVPIKEEGSGSSALFWTPSGEVRR 

QRLRQHTVpIjiEQTELFiSPJlIiWQHVTPJV^ 

RBAQRQRARERQES LMP WKPQLFHLDP I'rQBWHYRYEDHSPWDP 
I^lAQPEQDGILRTrK»EAVARQTTFLGSPGPPJIERSGPI>QRIi 
RKASlXJPSGHSOATESSGSTPEsCPBI>SDEEQIXn>FVPGGESPC 
PRCRKEARRIjQALHEAI IiS I REAQQBIiHRHLSAMl»S STARAAQA 
PTPGIiLOSPRSWFIiLCVFbACQLFIKHIIiK 


7136 


2 


418 


DFVPSFRRPSGNTSQTVWLIiRAATLEKBVA^ 
SQQRKVRQMIEQLQNSKAVIQSKlATIQEIJaSKIAYIJSABNLEM 
HDRMEHL I BKQI SHGNFSTQARAKTENPGS IRIS KPPS PKPMPV 
IRWET 


7137 


2 


466 


WASGMSTVPGGSRHSI^IC^GGWGVTGGEEBSL^ 
GSFKVATQEPJ*POjlAQ^1RLRRQKKGWPFLGDFliT 
PDDLDGNTNKRS KEVRVLQEMQLLQVAAMN YRLRPLB KFV^*Y FT 
RMEQLSDKESYKLSCQLEPENP 


713B 


2 


466 


WASGMSTVpGGSRHSI^IQVRGGWGVTX^EEF^LTVPVAiyrWQA 
GS FKVATQ BRNPQRAQMRLRRQKKGVVP PXGDFLTKLQRLDSAI 
PDDIJX^NTNKRSKEVRVI^EMQLLQVAAMimUiRPLEKFVTYFT 
RWEQLSDKES YKLS CQIiEPENP 


7139 


1 


357 


SLRNSARGLKMAASAARGAAALRRSIKQPVAFVRRI PWTAASSQ 

LKEHFAQFGJTVRRCILPPDKBTGFHRGI^l^^ 

QENHI IDGVK\raVHTRRPKIJPQTSDDEKKDF 


7140 


1401 


1957 


J^SI^VLKAHGGLIPSSFCKX^HTGQYAI^BLFDLKVYDCFCS F 
NMNVSIJSKQLRPSQPWPRGKCRKTPGWEEARJPKAQDLRGDLGKT 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, (^Cysteine, D=*Aspartic Acid, B= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H»Histidine, I=Isoleucine, K=I»ysine, 
l»=Leucine, M^Methionine, N=Asparagine, 
P~Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threoniue, V«=Valine, 
W-Tryptophan, Y -Tyrosine, X= Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QAGPAEAHTRGPPIUjPAATGCPPHLPGLLSG IS VD1DPTGLQSQ 
WTPKGQDPPI^FSEDYQKSLI^EQYHLGLDQKIiRlCYVVGEIilWKF 
ADFMTNQCG 


7141 


124 


1073 


U>SRSCMLDMEDLEEDVRFIVDBTIjDFGGLSPSDSREEBDITVL 
VTPERPI^GI^HRSDPNAVAPAPO^VRLSI^PLSPEKLEEILD 
RANRLAAQLEQCAIiQDRESAGEGL 

VRDLLPTVNSI»TRSTPS/LKQPDASTPB***BGVSQGSPGYIWK 
EALQHEEGVTHIiQSVPCIQKPSIFSS\SRSTPPVRGRAGPSGRA 
AASEEl'RAAKLRGAAAKSSCQLP I PSAI PRPASRMPLTSRS VPP 
GRGALPPDSLSTRKGLPRPSTAGHRVRESGHKVPVSQRLNLPVM 
GATRSNLQPP 


7142 


658 




LIFI^IJIMELKMLSSVTIilllRAFLYWICLKPTSCLIFQNVLNLL 
KX*SRAVGVVVVMCRT/ySSDLQVGVIKPWLIjIjGSQDAAHDLDT 
LKKNKVTHIIJ^YGVBNAFLSDFTYKSISII^DLPETNIIiSYPP 
ECFEF IEEAKRKDGVVLVHCNA 


7143 


3 " 


773 


SI*EMSSDGRPIjSRMDSEDSISSTIMDVDSTISSGRSTPAMMNGQ 
GSTTSSSKNIAYNCCWDQCOACFNSSPDLADHIRSIHVDGQRGG 
VFVCLWKGCKVYNTPSTSQSWI^RHMLrasGDKPFKCVVGGCNA 
SFASOGGLARIIVPTHFSQOJNSSKVSSQPKAKEESPSKAGMNKRR 
KLKNKRRRSLARPHDFFDAQTliDAI RIIRAICFNLSAHI ES LGKO 
HSWFHSTVSIIiFFQIKYKTLQKNISTI ISKSLKI 


7144 


1 


988 


FRVNMQ1X3GPSPAEHSKAEESAGMEARFIjGLPDAAGSSGPTPAR 

RCPAPRPAGVSWIRDEVEKYNHNGVNALQLDPALNRLFTAGRD 

SIIRIWSVNQHKQDPYIASMEHHTDWVND1VLCCNGKTLISASS 

DTTVKVWNAHKGFCMSTI»RTHKDYVKAIiAYAKDKE 

QIFI»WDVNTLTALTASNNTVTTSSLSGNI03SIYSLAMMQIjGTI I 

VSGSTEKVLRVWDPRTO^KLMKIiK^ 

GSSDGTIRLWSIjGQ£RCIA1TRVHDEGVWALQVN^ 

RDRKIYCTDLRNPDIRVLICE 



TRADOCS:l416260.l(yrfSK01!.DOC) 
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WHAT IS CLAIMED IS: 

1. An isolated polynucleotide comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO:M786 and 3573-5358, a mature protein coding portion 
of SEQ ID NO:l-1786 and 3573-5358, an active domain of SEQ ID NO:M786 and 
3573-5358, and complementary sequences thereof 

2. An isolated polynucleotide encoding a polypeptide with biological activity, 
wherein said polynucleotide hybridizes to the polynucleotide of claim 1 under stringent 
hybridization conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity, 
wherein said polynucleotide has greater than about 90% sequence identity with the 
polynucleotide of claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1. 

7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell 

10. An isolated polypeptide, wherein the polypeptide is selected from the group 
consisting of: 
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(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 

(b) a polypeptide encoded by a polynucleotide hybridizing under stringent 
conditions with any one ofSEQ ID NO: 1-1786 and 3573-5358. 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 

12. An antibody directed against the polypeptide of claim 10. 

♦ 13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of claim I for a period sufficient to form the complex; 
and 

b) detecting the complex, so that if a complex is detected, the 
polynucleotide of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions 
with nucleic acid primers that anneal to the polynucleotide of claim 1 under such 
conditions; 

b) amplifying a product comprising at least a portion of the 
polynucleotide of claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in 

the sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the 
method further comprises reverse transcribing an annealed RNA molecule into a cDNA 
polynucleotide. 

16. A method for detecting the polypeptide of claim 1 0 in a sample, comprising: 
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a) contacting the sample with a compound that binds to and forms a 
complex with the polypeptide under conditions and for a period sufficient to form the 
complex; and 

b) detecting formation of the complex, so that if a complex formation 
is detected, the polypeptide of claim 10 is detected. 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if the polypeptide/compound 
complex is detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10, in a 
cell, under conditions sufficient to form a polypeptide/compound complex, wherein the 
complex drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence 
expression, so that if the polypeptide/compound complex is detected, a compound that 
binds to the polypeptide of claim 1 0 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected 
from the group consisting of a polynucleotide sequence of SBQ ID NO:l-1786 and 3573- 
5358, a mature protein coding portion of SEQ ID NO:l-1786 and 3573-5358, an active 
domain of SEQ ID NO: 1-1 786 and 3573-5358, complementary sequences thereof and a 
polynucleotide sequence hybridizing under stringent conditions to SEQ ID NO: 1-1 786 
and 3573-5358, under conditions sufficient to express the polypeptide in said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 
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20. An isolated polypeptide comprising an amino acid sequence selected from the i 
group consisting of any one of the polypeptides SEQ ID NO:1787 -3572 and 5359-7144, 
the mature protein portion thereof, or the active domain thereof. 

2L The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprising the sequence 
information of at least one of SEQ ID NO:l-1786 and 3573-5358. 

23. The collection of claim 22, wherein the collection is provided on a nucleic acid 
array. 

24. The collection of claim 23, wherein the array detects full-matches to any one of 
the polynucleotides in the collection, 

25. The collection of claim 23, wherein the array detects mismatches to any one of 
the polynucleotides in the collection. 

26. The collection of claim 22, wherein the collection is provided in a computer- 
readable format. 

27. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising a polypeptide of claim 1 0 or 20 
and a pharmaceutical^ acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising an antibody that specifically 
binds to a polypeptide of claim 10 or 20 and a pharmaceutical^ acceptable carrier. 
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